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A FOLLOW-UP OF SOME EARLIER WIS- 
CONSIN STUDIES OF 
TEACHING ABILITY 


WILLARD JOSEPH BRANDT* 
University of Wisconsin 


Section I 


THE PROBLEM 
Introduction 


The problem of measurement and prediction 
of teaching efficiency has become increasingly 
important since the turn of the century. During 
the period from 1930 to 1940, the teacher train- 
ing institutions were graduating about 50,000 
more teachers than were needed. Some of the 
teacher training institutions were attempting to 
select the students entering the teaching pro- 
fession. School superintendents were selecting 
the individuals whom they thought would do the 
best job. Most of the selections were made on 
the basis of subjective data. 


Almost immediately following the period 
during which there was a large surplus of qual- 
ified teachers came the war period with a short- 
age of teachers. Many left the teaching pro- 
fession for more lucrative positions or were 





inducted into the armed services. The ques- 
tions quite generally asked were: Are the best 
teachers leaving the profession? Did the teach- 
ing profession improve during the war period 
by losing those who did not enjoy teaching and 
who may not have been the most successful? 
How may better teachers be recruited and ed- 
ucated for teaching? 


It has been estimated by the National Educa- 
tion Association that about 100,000 fewer elem- 
entary teachers than needed have been trained 
each year for the past two years. This short- 
age may not, however, continue for many years. 
To meet this present shortage of elementary 
teachers, the teacher training institutions have 
organized recruitment programs. What cri- 
teria should they use in selecting prospective 
teachers? 


This follow-up investigation is concerned 
with the validity and reliability of predictions 
of teaching success, made C. V. LaDuke! 
L. E. Rostker2, R. D. Jones, and L. J. Lins4. 





* The euthor wishes to express his deep envreciation to his major professor, 
Dr. A. S. Berr, for his introduction to the problem and for his advice, 


eid end encouragement when vrogress was slow. 


hie time and guidance. 


He was most generous with 


Appreciation is extended to C. V. LaDuke, L. E. 


Rostker, R. D. Jones, and L. J. Lins for their assistance in obtaining 


data to conduct this etudy. 


The author ie eleo indebted to the teachers, 


orincirals, supervisors, greduate students, and the many others who asris- 


ted in this follow-up study. 


1. GC. V. LaDuke, "The Measurement of Teaching Ability," Journal of Experi- 


mental Education, XIV (September 1945), pp. 75-100. 








2. L. E. Rostker, “The Measurement of Teaching Ability," Journal of Exper- 


imental Education, XIV (September 1945), pp. 6-51. 








3. R. D. Jones, “Prediction of Teaching Efficiency from Objective Measures," 
Journal of Experimental Education, XV (September 1946), pp. 85-99. 


4. L. J. Lins, "The Prediction of Teaching Efficiency," Journal of Experi- 


mental Education, XV (September 1946), pp. 2-60. 
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Have important changes occurred during the 
interim between the time when these investi- 
gations were conducted and the current status 
of the teachers involved in these studies? 


Statement of the Problem 





The four studies, with which this follow-up 
study is concerned, had in general the same 
objectives. They differed, however, in the 
time at which each study was conducted. Ina 
sense, the LaDuke5 and Rostker® studies con- 
stituted the background for the Jones? and Lins® 
studies. The LaDuke and Rostker studies 
attempted to discover the interrelationships of 
certain factors--after the teachers were in the 
field, which might be employed in future pre- 
diction studies. The Jones and Lins studies 
started with the people during the pre-service 
period and attempted to ascertain the validity 
and reliability of measures commonly used to 
predict teaching success in the field. All of 
these studies attempted to find the interrela- 
tionships between measures when applied 
within short intervals of time. The follow-up 
study attempts to find the interrelationships 
between measures after a lapse of eight years 
for the LaDuke study, eleven years for the 
Rostker study, and four years for the Lins and 
Jones studies. 


The purposes of this follow-up study then are: 
(1) to find the interrelationships at the present 
time between some of the measures employed in 
the original studies, (2) to determine if there 
have been any significant changes in the rela- 
tionships found between certain measures of 
teaching ability and efficiency as reported in 
the original studies, after a lapse of time, and 
(3) if the predictions of teaching success made 
in the original studies in terms of the measures 
employed are valid and reliable. 


A secondary purpose of this follow-up study is 
to ascertain whether there is any relationship 
between teaching success as measured by the 
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criterion of this follow-up study and six quali- 
tative factors obtained from a Data Booklet 
(Appendix B1).* 


Section Il 


GENERAL DESIGN OF THE STUDY 


Selection of Participating Teachers 





The teachers participating in this follow-up 
study on the measurement and prediction of 
teaching efficiency were chosen from those who 
had participated in the previous studies made by 
LaDuke9, Rostker10, Jones11, and Lins12, and 
who had continued in teaching until May, 1947. 
Of the 185 teachers who participated in the orig- 
inal studies, 80 were still teaching and indicated 
a willingness to cooperate in another study. 


A rating on the Wisconsin Adaptation of the 
M-Blank, was obtained for 57 of the 80 teachers. 
Thirty-five of the 57 teachers, for whom the 
M-Blank rating had been secured, completed 
and returned the Data Booklets. 


The information on the 35 teachers who re- 
turned the Data Booklets is analyzed separately. 


Collection of Teacher Data 





The first correspondence with the participants 
was a postal reply card (Appendix B2), which 
asked for information so that the files in the 
Teacher Personnel Research Bureau might be 
brought up-to-date. If no reply was received to 
the first card, it was followed by a letter and 
another card which asked for the same infor- 
mation. 


A follow-up card, sent to all teachers who had 
returned the first information card, asked them 
if they were willing to cooperate in this study. 
Of the 185 who had participated in one or another 
of the four previous investigations, 80 expressed 





5. LeDuke, 


Op. cit. 
Op. cit. 


6. Rostker, 
7. Jones, op. cit. 
8. Lins, op. cit. 


* All references to Appendices may be found in original thesis on file in 


University of Wisconsin Library. 
9. LaDuke, Ibid. 
10. Rostker, Ibid. 
11. Jones, Ibid. 
12. Lins, Ibid. 
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a willingness to cooperate in another study. Of 
those willing to participate in this study, nine 
were of the original 34 in the LaDuke study, 17 of 
the 28 in the Rostker study, 35 of the 65 in the 
Jones study, and 19 of the 58 in the Lins study. 


In cooperation with Miss Frances Kline and 
other graduate students enrolled in the School 
of Education at the University of Wisconsin 
d 1946-47, the original Student Data Book- 
let!? and the Supplementary Data Booklet 14 
used by Lins and Jones were revised and com- 
bined into a new Data Booklet (Appendix B1). 


Each teacher who had indicated a willingness 
to cooperate in this study was sent a copy of 
the new Data Booklet and asked to supply the 
information and return it as soon as possible. 
Of the 39 Data Booklets returned, four were 
eliminated because they were incomplete. The 
information in the Data Booklet included such 
items as age, sex, years of teaching experience, 
training beyond high school, type of teaching 
certificate, beginning teaching salary, salary in 
1947, parental occupation, work experience 
other than teaching, participation in music, dra- 
matics, community activities, reading interests 
and membership in professional organizations 
(Appendix B1). 


Criterion and Other Measures Applied to 
the Teachers in the Follow-Up Study 








The criterion of teaching efficiency used in 
this follow-up study is a supervisory rating made 
on the Wisconsin Adaptation of the M-Blank (Ap- 
pendix B5). The M-Blank was also used as a 
measurement of teaching efficiency in the orig- 
inal studies made by Lins and Jones. 


A letter of explanation and directions was sent 
with a copy of the M-Blank to the principal, sup- 
erintendent or supervisor who was responsible 
for the supervision of the individual participat- 
ing in this study. Fifty-seven of the 80 M-Blanks 
sent to the supervisors were completed and re- 
turned. 


The final evaluations on the M-Blank were 
made on a five-point scale of five for outstand- 
ing, four for above average, three for average, 
two for below average and one for poor or in- 
ferior. 


Additional ratings on the participants were 
obtained from the offices of the Teacher Place- 
ment Department of the Wisconsin Industrial 
Commission, the Teacher Placement Bureau of 
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the University of Wisconsin, and the State De- 
partment of Public Instruction. 


The evaluation sheet used by the Industrial 
Commission (Appendix B7), employed a scale 
of five categories: poor, below average, aver- 
age, above average, and outstanding. The 
Placement Bureau of the University of Wiscon- 
sin (Appendix B8) used a scale of four categor- 
ies: poor, fair, good, and excellent, with sub- 
divisions under each category. These sub- 
divisions were arbitrarily divided by the writer 
into five parts to correspond with the other 
scales. The State Department of Public Instruc- 
tion used the letters A, B, C, D, and E to rate 
the efficiency of the teachers. Since the three 
rating sheets used by the Industrial Commission, 
the University of Wisconsin, and the State De- 
partment of Public Instruction were very sim- 
ilar, the five-point scale used in the M-Blank 
was used with these. 


The evaluations made on the rating sheets 
used by the Industrial Commission, the Univer- 
sity of Wisconsin, and the State Department of 
Public Instruction were obtained from their 
records. These ratings had been made by pre- 
vious principals or supervisors, with the excep- 
tion of the State Department of Public Instruc- 
tion, whose supervisors had made the ratings. 


Ratings for 46 participants were secured on 
the Industrial Commission’s rating sheet, for 
23 teachers on the blank used by the Placement 
Bureau of the University of Wisconsin, and for 
27 teachers by the State Department of Public 
Instruction. 


The predictions of teaching efficiency obtained, 
when using the same regression equation and 
independent variables employed in the original 
studies, will be compared with the actual scores 
received on the M-Blank in this follow-up study. 


The follow-up study also attempts to ascertain 
if there are any significant relationships be- 
tween the criterion and the following qualitative 
traits: 


1. The father was or was not a farmer. 


2. The individual did or did not participate 
in athletics in high school and/or college. 


3. The individual did or did not participate 
in musical activities in high school and/or 
college. 





13. L. J. Lins, 


Poth of Teaching Efficiency, Unpublished Thesis, Univer- 
sity of Wisconsin rary, son, 1946 
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4. The individual did or did not participate 
in dramatics in high school and/or college. 


5. The individual did or did not participate 
in community activities as a teacher. 


6. The individual did or did not belong to a 
professional organization as a teacher. 


When analyzing the above named six qualitative 
factors, the information for the teachers in the 
original studies by LaDuke and Rostker was 
combined, and also the information relative to 
the Jones and Lins studies was combined. 


Although elsewhere in this study comparisons 
made between this follow-up study and the four 
preceding investigations have been separate, it 


seemed better and satisfactory to combine them 
here. 


Section IM 
ANALYSIS OF THE DATA 
Study Number One - LaDuke 





Teacher Measures 


The criterion of supervisory ratings used in 
this follow-up study was made on the Wiscon- 
sin Adaptation of the M-Blank. In addition to 
the M-Blank ratings, ratings for three teachers 
were secured on the evaluation sheet used by 
the Teacher Placement Department of the Wis- 
consin Industrial Commission. 


The correlations reported by LaDuke15 be- 
tween the supervisory rating scales used in the 
original study and the criterion of pupil gain 
employed in the original study when the rating 
was made by the county superintendent, were 
as follows: 


Criterion with Almy-Sorenson 
Rating Scale 


Criterion with Michigan Rating 
Scale 


Criterion with Torgerson Rat- 

ing Scale -00 
When the ratings of the teachers were made by 
supervising teachers, the correlations between 
the criterion of pupil gain employed in the orig- 
inal study and the supervisory ratings used in 
the original study were: 
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Criterion with Almy~-Sorenson 
Rating Scale 


Criterion with Michigan Rating 
Scale 


Criterion with Torgerson Rat- 
ing Scale 


The zero order correlation coefficient between 
the criterion of pupil gain employed in the orig- 
inal study and the criterion of supervisory rat- 
ings employed in this follow-up study, however, 
is .14 . Although this correlation between the 
criterion of pupil gain in the original study and 
the criterion of supervisory ratings in this 
follow-up study is not large enough to be statis- 
tically significant, it does seem to indicate a 
closer relationship between the measures after 
a lapse of eight years. It appears that the pre- 
dictions of teaching success made by LaDuke 
have improved rather than deteriorated over a 
period of years. 


The correlations between the supervisory 
ratings made by county superintendents in the 
original study and the supervisory ratings in 
this follow-up study, Table I, were: 


M-Blank with Almy-Sorenson 
Rating Scale 


M-Blank with Michigan Rating 
Scale 


M-Blank with Torgerson Rating 
Scale -73 


Using the ratings made by the supervising 
teachers, the correlations were: 


M-Blank with Almy-Sorenson 
Rating Scale 


M-Blank with Michigan Rating 
Scale 


M-Blank with Torgerson Rating 
Scale 


Although the correlations between the super- 
visory ratings of the original and this follow- 
up study are high, they are not statistically 
significant. 


A low correlation between pupil gain and 
supervisory ratings also agrees with the find- 





15. C. V. LaDuke, 


Education, xiv (September 1945), p. 
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ings of Barr16, who obtained coefficients of 
correlation ranging from .09 to .13 between 
supervisory ratings and pupil gain, and Gotham17 
found a correlation coefficient ranging from 
-35 to .43 using the same supervisory rating 
scales as LaDuke. 


Data Booklet Information - LaDuke Study 





A better understanding of the participants of 
this investigation can be obtained by studying 
some of the information obtained from the Data 
Booklet (Appendix B1). 


Table I shows that the number of schools in 
which the participants had taught varied from 
four to nine, with a mean of six. About three 
years was the usual period of tenure of a 
teacher in any one school. The lowest begin- 
ning annual salary was $585, and the highest 
beginning annual salary was $900, with a 
mean beginning annual salary of $772.50. In 
1947, the highest annual salary was $2475, and 
the lowest annual salary was $2025, with 
$2153.33 as the mean annual salary. It is inter- 
esting to note that the teacher who received 
the lowest beginning salary ($585) received the 
highest annual salary ($2475) in 1947, or an in- 
crease of over 425 percent, and the teacher who 
had received the highest beginning annual salary 
($900) received the lowest annual salary ($2025) 
in 1947, or an increase of only 225 percent. It 
should also be noted in Table II that it was not 
the teacher who moved most frequently who re- 
ceived the highest salary in 1947. 


In 1939, all of the participating teachers had 
county one-year licenses. In 1947, two had life 
certificates, two had five-year state licenses, 
and one had a state one-year license.18 The 
two teachers with life certificates had completed 
three additional years of schooling. 


The Pearson product-moment correlation co- 
efficient between the criterion of supervisory 
rating used in this follow-up study and the salary 
received in 1947 is .59 + .30, which is signif- 
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icant at the 19.02 percent level (Table XIX, 
Appendix A). Even though the correlation coef- 
ficient is not large enough to be statistically sig- 
nificant, it might be said that it substantiates a 
popular opinion that there is not too much rela- 
tionship between teaching efficiency and salary 
received. It also agrees with the findings of 
other previous studies. F. B. Knight19 found a 
correlation coefficient of .35 between salaries 
and teaching efficiency as rated by supervisors, 
mutual judgments of the teachers themselves 
and consensus of pupil opinions. R. L. Morton20 
obtained a correlation of coefficient of .24 be- 
tween salaries and the criterion of pupil re- 
actions in the cities studied. 


Statistical treatment of some of the infor- 
mation in the Data Booklet, to determine if 
there are any significant relationships between 
the criterion and six selected qualitative vari- 
ables, will be included in Study Number Two of 
this section. 


Study Number Two - Rostker 





Criterion and Other Teacher Measures 


The criterion of teaching ability employed in 
this part of the follow-up study was that of super- 
visory ratings. Of the 28 teachers participating 
in the original study in 1936, 19 were still 
teaching in 1947, and supervisory ratings on the 
Wisconsin Adaptation of the M-Blank were ob- 
tained for 12 of the 19. 


The correlation between the criterion of 
pupil gain used in the original study and the 
criterion of supervisory ratings used in the 
follow-up study is .35 + .27. In the original 
study by Rostker21 the coefficients of correl- 
ation between the criterion of pupil gain and 
the supervisory ratings, Table III, on three - 
different scales were: 


Pupil gain with Almy-Sorenson 


Rating Scale -.02 





16. 


A. S. Barr, “The Validity of Certain Instruments Employed in the Measure- 
ment of Teaching Ability," The Measurement of Teaching Effici (Rew 
York: Macmillan 0o., 1935), pp. 71-141. 


R. E. Gotham, "Personality and Teaching Efficiency," Journal of Exper~ 
imental- Education, XIV (December 1945), pp. 157-165. 


17. 


18. Since 1939, a state one-year license is equivalent to a county one-year 


license. 


F. B. Knight, 
to Education, 
1922). 


19. lities Related to Success in Teachi 


ew York: Teachers ege, 


, Contributions 


lo. Olumbia University, 


20. 


R. L. Morton, "Qualities of Merit in Seconda Mg oe Educational 


ry 
Administration and Supervision, V (May 1919), pp. 
Rostker, op. cit. 
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TABLE Il 


ZERO ORDER COEFFICIENTS OF CORRELATION, STANDARD ERROR OF 
CORRELATION AND LEVEL OF SIGNIFICANCE BETWEEN THREE RATING 
SCALES EMPLOYED BY ROSTKER IN HIS ORIGINAL STUDY AND THE 
M-BLANK RATING EMPLOYED IN THE FOLLOW-UP STUDY 





Percent 
Level of 
r Er Significance 


Michigan Rating Scale -27 + .28 36.82 





Torgerson Rating Scale -19 + .20 53.52 
Almy-Sorenson Rating Scale ell + .27 73.00 











TABLE IV 


COMPARISON OF PREDICTED SCORES USING SIX VARIABLES 
WITH THE ACTUAL M-BLANK SCORES IN THE FOLLOW-UP 
STUDY (ROSTKER) 





Teacher Predicted Actual M-Blank 
Number Score Score Difference 





39 3.61 2.88 -.73 
40 3.62 4.11 -49 
41 3.60 3.14 
42 3.32 4.22 
43 4.32 
44 
45 
46 
47 
48 
49 
50 
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Pupil gain with Michigan Rating 
Scale «15 
Pupil gain with Torgerson Rat- 
ing Scale -.01 
The correlation coefficients between the 
supervisory rating scales used in the original 
study and the supervisory ratings of this fol- 
low-up study are: 


M-Blank with Almy-Sorenson 

Rating Scale -ll 
M-Blank with Michigan Rating 

Scale 27 
M-Blank with Torgerson Rat- 

ing Scale 19 
Although the correlations in the above instances 
are not large enough to be statistically signifi- 
cant, the larger correlation of .35 between the 
supervisory ratings of this follow-up study and 
the criterion of pupil gain used in the original 
study shows that a closer relationship does 
seem to be present between the two measures, 
applied after a lapse of 11 years. 


Employing the general form for the multiple 
regression equation which can be written as: 


Ke = Bey S& 1 + Beg Se X2 wn + Be, Oe Xp 
on 


+ (Me- By, oc My - Beg dc Mg... 
a 2 
~ Bo, de M,) 
on 


in which X¢ indicates the criterion or dependent 
variable and X1, X2, and Xp the independent var- 
iables. The proper calculations as indicated by 
the equation give the following multiple regres- 
sion equation in raw score form: 


X, = .008 Xj + .004 Xp + .001 Xg + .359 Xq 
+ 017 X5 - .003 Xg - .89 


The scores obtained by the individual teachers 
on the six teacher measures used, and reported 
by Rostker22, were substituted for the independ- 
ent variables in the regression equation. A com- 
parison of the predicted scores and actual 
scores obtained by the teachers is found in 
Table IV. The actual scores deviate from the 
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predicted scores within the range of -.94 to +.90. 
Eight of the scores deviate from the mean by 
less than 0.5 of a point, and they all deviate by 
less than 1.0. The mean deviation between the 
predicted score and the actual score in this part 
of the follow-up study is .51, which is about the 
same as found for the Jones and Lins studies. 
which are reported in Tables ¥I, XII, XVII, and 
XVIII respectively in the follow-up study. The 
increased relationship, cited previously, be- 
tween the criterion of pupil gain in the original 
study and the criterion of this follow-up study, 
and the small mean deviation of the actual 
scores in this follow-up study from the pre- 
dicted scores when using six teacher measures 
employed by Rostker, seems to indicate that 

the predictions of teaching ability have improved 
rather than deteriorated during the period be- 
tween the two studies. 


Data Booklet Information - Rostker Study 





Information concerning the participants of the 
Rostker study23, found in Table V, shows that 
the number of schools in which the teachers 
have held positions range from three to ten. 
with a mean of about four years tenure in any 
school. Their beginning annual teaching salary 
ranged from $225 to $1500. Much of this var- 
iance is due to the greater number of years 
some have been teaching. The number of years 
in teaching ranged from 19 to 42, with a mean 
of 27.5 years. The lowest annual salary in 1947 
was $1710, while the highest was $2974. The 
mean beginning annual salary was $742.50. and 
the mean annual salary in 1947 was $2457.90. 
The individual who had taught 42 years had in- 
creased his salary about 250 percent. All of 
the participants had life certificates in 1947. 


A Pearson Product-Moment coefficient of 
correlation between the amount of salary re- 
ceived in 1947 and the criterion was .85 + .09., 
which is significant at the 00.78 percent level, 
(Table XIX, Appendix A). This seems to indi- 
cate that there may be some degree of rela- 
tionship between teaching efficiency as deter- 
mined by the criterion of this study and their 
annual salary. The significant correlation be- 
tween teaching efficiency as measured by the 
criterion of this follow-up study and salary as 
found here, however, is not in agreement with 
the findings in the sections of this follow-up 
study which are concerned with the LaDuke 
and Jones studies. As has been previously 
stated, most investigations have shown an in- 
significant relationship between salary and 
teaching efficiency. 





20 


22, L. E, Rostker, Measurement of Teachi Ability, Unrublished Thesis, 
University of Wisconsin Library, Wedteon, Wisconsin, 1941. 


23 L. E. Rostker, “Measurement of Teaching Ability," Journal of Exreri- 


mental Educetion, XIV (Sentember 1945), np. 6-51. 














06°LSbZ$ =W OS°2bL $= W 
00T? 00L Sepeih 





8T 


00TZ SepeiH ve 


0092 sepeiy 4 7 


sepein 6I 


Sepeiy 0z 


sapein ze 


sepein If 


sepery Té 


sepery TZ 
“weld 
Sepeiy 
“weld 


oe ewe: @ 2 A oe 
woe hm te mo Be m& SS & 


LZ 


S 
a 
< 
o 
> 
Q 
a 
z 
& 
: 
B 
ta 
fa, 
° 
pe 
A 
> 
ro} 





waneL [uy wsney, | ayes TOoyds Yy3}H | Buyyovay 
apery 10 | sjooyss | -11719D | puodeg Buy JO srvex 
yafqns JO *ON -UJBlyL “SI 



































(9661) AGA.LS UAMLSOU - NOLLVWYOANI UAHOVAEL 
A FIdvL 





(September 1949) 


In this part of the follow-up study, as in the 
section concerning the LaDuke investigation, 
the data obtained does not warrnat postulations 
as to whether the teachers concerned have be- 
come more or less efficient in class room in- 
struction. 


Six Qualitative Factors Combining the LaDuke 
and Rostker Studies 








Part of the information from the Data Book- 
lets which the investigator wanted to use could 
not be numerically expressed. A biserial co- 
efficient of correlation was accordingly used 
to determine the amount of relationship between 
the criterion, numerically expressed, and the 
other variable expressed in two alternative 
categories24: 

This = M2- m1 Pa 


co y 


m = mean of smaller frequency distri- 
bution 


M = mean of larger frequency distri- 
bution 


= standard deviation of total frequency 
distribution 


where: p = proportion of the total frequency 
of the larger distribution to the 
grand total frequency. 


q=1-p 


y = height of the ordinate of a 
normal curve of unit area and 
unit standard deviation at the 
point separating the area of 
the curve into proportion p 
and q. 


The processes observed in calculating biserial 
r are based on some of the properties of the 
normal probability curve. It was assumed that 
the dichotomous traits were normally distrib- 
uted and continuous, and also that the relation- 
ship between the two factors was linear. 


A biserial coefficient of correlation was cal- 
culated for the following qualitative traits: 


1. The father was or was not a farmer. 
2. The individual did or did not participate 


in athietics in high school and/or 
college. 
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3. The individual did or did not participate 
in musical activities in high school 
and/or college. 


4. The individual did or did not participate 
in dramatics in high school and/or 
college. 


5. The individual did or did not participate 
in community activities as a teacher. 


6. The individual did or did not belong to 
a professional organization as a teacher. 


The standard error of the biserial coefficients 
of correlation, the biserial correlation coeffic- 
ient and the percent level of significance is found 
in Table VI. 


An examination of Table VI shows a biserial 
coefficient of correlation of .35 + .22 between 
the criterion of the follow-up study and whether 
the father was or was not a farmer. This is 
significant at the 26.16 percent level. Even 
though the coefficient of correlation is not large 
enough to be statistically significant at the one 
or five percent level, there seems to be some 
relationship between their success as rural 
and/or elementary teachers and the fact that 
their father was a farmer. A closer analysis 
of the data in the LaDuke and Rostker studies 
shows that all the participants were elementary 
teachers, and all would be called rural if the 
sociological definition of a rural community be- 
ing one of less than 2500 population were used. 


The biserial coefficient of correlation between 
the criterion of the follow-up study and partici- 
pation in athletics in high school and/or college 
is .37 + .22, and between participation in musical 
activities is .09 + .26, neither of which is signif- 
icant at the five percent level or less. This 
would seem to indicate that there is very little 
relationship between success in teaching and 
participation in athletics or musical activities 
in high school and/or college. A negative bi- 
serial coefficient of correlation, -.47 + .20 is 
found between the criterion of this follow-up 
study and participation in community activities 
as a teacher, which is significant at the 7.04 per- 
cent level. 


Seagoe25 found a correlation of coefficient of 
-.05 between teaching success and musical 
talent, as determined by percentile rank made 
by the administrator. A zero order coefficient 
of correlation of .37 between personal data and 





24. T. OC. McCormick, 
Book Co., 1941), p. ° 


lementary Social Statistics (New York: McGraw-Hill 


25. M. V. Seagoe, "Prediction of In-Service Success in Teaching," Journal 
of Educational Research, XXXIX (May 1946), pp. 658-663. 
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TABLE VI 


BISERIAL COEFFICIENTS OF CORRELATION, STANDARD ERROR OF 
CORRELATION AND LEVEL OF SIGNIFICANCE BETWEEN SIX DICHO- ients o! 
TOMIES AND THE CRITERION OF THE FOLLOW-UP STUDY (LADUKE and ant 
AND ROSTKER STUDIES COMBINED) coeffic 
visory 
was for 


not gre 
Percent Level follow- 


Dichotomy of Significance 











Father was or was not 
a farmer 26.16 


Did or did not partici- 
pate in athletics in 
high school and/or 
college 


Did or did not partici- 
pate in musical activ- 
ities in high school 
and/or college 


Did or did not partici- Pe te 
pate in dramatics in stud 
high school and/or y; 
college ae: 


Did or did not partici- 
pate in community 
activities as a teacher 


Did or did not belong to 
a professional organ- 
ization 











N = 16 


TABLE VII 


ZERO ORDER COEFFICIENTS OF CORRELATION, STANDARD ERROR OF 
THE CORRELATION, AND SIGNIFICANCE LEVEL OF THE CORRELA- 
TION BETWEEN THE INCREASE IN SALARY SINCE BEGINNING 
TEACHING AND THE NUMBER OF TEACHING POSITIONS HELD 





Percent Level 
Er of Significance 


91.24 





3.00 


73.38 
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musical talent was found by Durlinger26. Bos- 
sing27 and Boyce28 found correlation coeffic- 
ients of .06 and .57 between supervisory rating 
and attitude and interest in the community. A 
coefficient of correlation of .41 between super- 
visory ratings and extra-curricular activities 
was found by Somers29. These results are 

not greatly different from those found in the 
follow-up study, which would seem to indicate 
that the relationship between the factors con- 
sidered does not change significantly over a 
period of years. 


A Pearson Product-Moment coefficient of 
correlation was also calculated to see if there 
was any significant relationship between the 
increase in salary since beginning teaching 
and the number of teaching positions held. 
The coefficient of correlation, found in Table 
Vil, is .09 + .27, which is significant at the 
73.38 percent level. 


Summary and Conclusions 





As a result of this part of the follow-up study 
of the teachers who participated in the Rostker 
study, certain conclusions seem warranted: 


1. Although the correlations between the 
measures employed in the original study 
and the measures employed in this follow- 
up study are not large enough to be statis- 
tically significant, the relationship has 
become closer than it was in the original 
study. Thus, it appears that the predic- 
tions of teaching efficiency made in the 
original studies have improved during the 
lapse of time rather than deteriorated. 


The criterion used in the follow-up study 
did not show any significant change in the 
teaching efficiency of the individuals con- 
cerned since the previous studies were 
conducted in 1936 and 1939. 


The zero order coefficient of correlation 
between the criterion of pupil gain in the 
original study by Rostker and the criter- 
ion of supervisory ratings used in this 
follow-up study was not significant. 
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4. No significant coefficients of correlation 
were found between the supervisory 
ratings obtained by Rostker and the criter- 
ion of this follow-up study. 


For the LaDuke participants, there is no 
significant correlation between salary 
paid in 1947 and success as a teacher, as 
found with the M-Blank, while for the 
Rostker participants the correlation is 
highly significant. 


There seems to be no relationship between 
the number of times a teacher changes 
positions and the salary increase. 


- There seems to be no significant relation- 
ship between the six dichotomous variables 
considered and success as measured by 
the criterion. 


Study Number Three - Jones 





Criterion and Other Teacher Measures 


The primary purpose of this section of the 
follow-up study, as in the two preceding sec- 
tions, is to determine if the predictions made 


by the original investigators were reliable and 


valid and to determine if the teachers included 
in this study made any significant changes in 
their teaching efficiency during the time which 
has lapsed between the first investigation and 
the present one. 


The criterion of teaching efficiency in this 
follow-up study is supervisory ratings, which 
was also one of the criteria used by Jones in the 
original study. 


As stated in Section II, the criterion of super- 
visory ratings were secured by sending copies 
of the Wisconsin Adaptation of the M-Blank to 
the principals of the schools where the teachers 
were located in 1947. A similar technique was 
employed by Jones. 


Pearson Product-Moment coefficients of cor- 
relation between the criterion (M-Blank ratings) 
used in this follow-up study and one of the cri- 





26. G. W, Durlinger, "Scholastic Prediction in e Teachers College," 
Journal of Experimental Educetion, XI (June 1943), pn. 257-267. 





27. N. L. BSosring, 


etin, No. 
of Educetion, 1932), po. 117-133. 


. Teacher Aptitude Tests and Teacher Selection, Research 
in Higher Educetion Sulletin, No. le (Jashington, D. o.: U. S. Office 


28, A. C, Boyce, “Qualities of Merit in Secondary School Teechere," Jour- 
nal of Educational Psychology, III (March 1912), mr. 144-157. 





29. G. T, Somers, Pedagogicel Fro 


nective Teachers, Contr: 


osis: 





ons to Ed 


Feedicting the Success of Pros- 
ucation, No, 9 (New York: 


Teechers College, Columbie University, 1923). 





14 JOURNAL OF EXPERIMENTAL EDUCATION 


teria employed by Jones, namely, that of super- 
visory ratings, were calculated. The Wisconsin 
Adaptation of the M-Blank was also used by 
Jones to secure supervisory ratings. 


The coefficient of correlation between the super- 
visory ratings obtained on the M-Blank by Jones 
and the supervisory ratings obtained on the M- 
Blank in this follow-up study, as shown in Table 
VIM, is .48 + .17, which is significant at the 2.66 
percent level. This tends to substantiate further 
Lins’ findings regarding the reliability and val- 
idity of the M-Blank which, as stated in Section 
Il, Lins found significant. Whatever the M-Blank 
measures, it seems to do so with a high degree 
of reliability and after a period of years. 


A comparison of the M-Blank ratings obtained 
in this follow-up study and the M-Blank ratings 
secured by Jones is found in Table IX. It shows 
a difference of .11 in the means of the two ratings. 
Using the : for the critical ratio between 
the means:30 


M1 - M2 
» "12 Ey Es 





"Eh, * E2 


it is found that the difference is significant at the 
80.08 percent level. Accepting the one or five 
percent level of significance as being satisfac- 
tory. the change in the teachers as measured by 
the M-Blank is not significant. However, Table 
IX shows that 16 of the teachers received a 
higher rating, with only six receiving a lower 
rating. It should also be noted that three 
teachers received a rating of more than one 
point less than the rating received in the orig- 
inal study using the five-point scale, explained 
in Section II. and one teacher increased her 
rating more than one point. 


Although the mean difference between the 
ratings observed in the original study and rat- 
ings obtained in this follow-up study is not 
large enough to be significant, it does show a 
tendency for most of the teachers to be rated 
higher now after a period of years. Assuming 
that this gain is not due to the ‘‘halo effect,’’ it 
does seem to indicate improvement in teaching 
ability. 


Coefficients of correlation were also calcu- 
lated between the ratings secured on the eval- 
uation sheets used by the Teacher Placement 
Department of the Wisconsin Industrial Com- 
mission (Appendix B7); the Placement Bureau 
of the University of Wisconsin (Appendix B8); 
the State Department of Public Instruction 
(Appendix B6); and the criterion of this follow- 
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up study. 


The coefficient of correlation between the 
Industrial Commission blank and the criterion 
of this follow-up study, as shown in Table VIII, 
is .53 + .19, which is significant at the 4.00 per- 
cent level. This would indicate that in only 
four samples in 100 might one expect to get 
a coefficient of this size by chance. The two 
rating sheets in this part of the study seem to 
be measuring to a degree the same qualities 
of teaching. Although the coefficients of cor- 
relation between the criterion of this follow-up 
study and the Rating Scale of the University of 
Wisconsin and the State Department of Public 
Instruction are .45 + .21 and .65 + .21, respec- 
tively, they are not significant. This lack of 
significant relationship with the criteria may 
be because different individuals made the 
ratings or that the rating sheets measure 
teaching qualities other than those measured 
by the M-Blank. 


Data Booklet Information - Jones Study 





Additional qualitative and quantitative data 
for 12 of the 22 teachers participating in this 
section of the study was obtained from the Data 
Booklet (Appendix B1). An observation of Table 
X, which has some of the data summarized, 
shows that the participants had taught a mean 
of 5.6 years, with a range varying from four to 
11 years. Teacher Number 38 had taught pre- 
vious to obtaining her degree from the Univer- 
sity of Wisconsin in 1942. During the same 
period of years, the number of schools taught 
in was 2.5. The range was one to seven. The 
lowest annual beginning salary was $1000, and 
the highest beginning annual salary was $1800, 


In 1947, the lowest annual salary was $$1920, 
and the highest annual salary was $2920, with 
$2356.66 as the mean salary; thus, the mean 
gain in salary from 1942 to 1947 was nearly 
$1100. 


The Pearson Product-Moment coefficient of 
correlation between the number of teaching 
positions held and increase in salary is .66 + 
-17, and is significant at the 3.00 percent 
level, Table VII. A correlation of this size 
would be obtained by chance not more than 
three times in a hundred, which leads to the 
conclusion that the teacher who changes pos- 
itions most frequently will increase her salary 
the most. It does not mean, however, that she 
will get the highest salary because she may 
have started at a much lower figure. 





The coefficient of correlation between the 


with a mean beginning annual salary of $1269.58. 





30. C. C. Peters and 7. R. VanVoorhis, 
Mathematicel Bases (New York: MoGraw- 
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TABLE Vill 


ZERO ORDER COEFFICIENTS OF CORRELATION, STANDARD ERROR OF 
THE CORRELATION AND SIGNIFICANCE LEVEL OF THE CORRELA- 
TION BETWEEN THE CRITERION EMPLOYED IN THIS FOLLOW- 
UP STUDY AND THE OTHER RATING SCALES USED (JONES STUDY) 





Percent Level 


Er of Significance 





Jones M-Blank 
Rating 


Industrial Com- 
mission Scale 


University of Wis- 
consin Rating 
Scale 


State Department of 
Public Instruction 








AB + .17 2.66 


53+ .19 4.04 








salary received in 1947 and the criterion of 
this follow-up study is .38 + .26, which is sig- 
nificant at the 20.78 percent level (Table XIX, 
Appendix A). Such a low correlation indicates 
that the best teachers, as rated by the criter- 
ion, do not necessarily receive the largest 
salary. It would also seem to indicate that 
the criterion measures different teaching 
qualities from those considered by the super- 
intendent when granting salary increases. The 
low correlation between salary and teaching 
efficiency found here is in agreement with the 
findings of Knight31 and Morton32. 


Statistical treatment of six dichotomous var- 
iables obtained from the Data Booklet will be 
found in Study Number Four of this section. 


Prediction of Teaching Success - Jones Study33 





The purpose of this section of the follow-up 
study differs in part from the other sections of 
the study previously discussed. The primary 
purpose of this section of the study is to ascer- 
tain if the predictions of teaching success made 
by Jones in the original study were valid and 
reliable. 


From the several factors that were used by 
Jones in the original study for prediction of 
teaching success, three were selected for use 
in this follow-up study; namely, (1) freshman- 
sophomore grade point average, (2) junior- 





jsenior grade point average, and (3) the Bell 
\Adjustment Inventory. The multiple regression 
equation using the above named factors gave a 
multiple correlation of .45, which, however, was 
not large enough to be statistically significant. 
The three above-listed variables were selected 
for use in this follow-up study because of the 
availability of the data needed. 


The general form for the multiple regression 
equation was employed and the proper calcula- 
tions as indicated by the equation give the fol- 
lowing multiple regression equation in raw 
score form: 


X, = -303 Xj + .417 Xp - .004 Xg + 2.28 


The scores obtained by the teachers, as found 
by Jones, for the three variables that were em- 
ployed for prediction in this study, were sub- 
stituted for the independent variables in the 
multiple regression equation. 


The predicted scores obtained, with the ac- 
tual scores obtained in this follow-up study, 
are found in Table XI. This shows a range in 
the deviations of the actual score of -1.04 to 
-86, with a mean deviation--disregarding signs 
when adding--of .48. 


A comparison of the predicted scores, with 
the actual score obtained by Jones in the orig- 
inal study, is found in Table XII. This shows 





$1. Knight, op. cit. 


° 
Or. cit. 


32, Morton, 


33. Jones, op. cit. 





JOURNAL OF EXPERIMENTAL EDUCATION 


TABLE Ix 


A COMPARISON OF M-BLANK RATINGS IN JONES’S ORIGINAL 
STUDY AND THE M-BLANK RATINGS IN THIS FOLLOW-UP STUDY 








M-Blank Rating M-Blank Rating 
Follow-up Study Jones Study Difference 


3.39 3.00 





4.27 3.75 
4.25 3.63 
4.61 4.00 
4.33 
4.44 
4.33 
4.55 
4.00 
3.94 
3.72 
3.66 
2.88 
4.39 
3.00 
3.77 
4.28 
3.61 
4.22 
3.33 


3.67 








4.11 


M = 3.94 
o= AT 
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TABLE XI 


COMPARISON OF THE PREDICTED SCORES OF TEACHING EFFIC- 
IENCY USING THREE VARIABLES* AND THE ACTUAL SCORES OF 
TEACHING SUCCESS AS MEASURED BY THE M-BLANK IN THIS 


FOLLOW-UP STUDY (JONES) 




















Teacher Predicted Actual Score 
Number Score in Follow-Up Difference 
13 3.93 3.39 . -.54 
14 3.41 4.27 86 
15 3.90 4.25 35 
16 3.86 4.61 75 
18 3.76 4.44 68 
19 4.13 4.33 -20 
20 4.13 4.55 42 
24 3.67 3.66 -.01 
26 3.62 4.39 17 
27 4.04 3.00 ~1.04 
30 3.64 3.61 -.03 
33 3.51 3.67 -16 
M = .48 





*Three Variables: 


1, Freshman-Sophomore Grade Point Average 
2. Junior-Senior Grade Point Average 
3. Bell Adjustment Inventory 
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a range in the deviations of the actual scores 
from the predicted scores of -1.79 to .11. The 
mean deviation between predicted and actual 
scores in the original study--disregarding 
signs when adding--is .51. The smaller mean 
deviation between the predicted and actual 
scores in this follow-up study than in the orig- 
inal study indicates a general improvement in 
the predictions after a period of years. Also, 
using the same rating scale in the original and 
this follow-up study to obtain the supervisory 
ratings, the actual scores of all except one of 
the teachers considered has improved. 


In view of these results, it would seem that 
prediction of teaching success, using the three 
variables employed in the original and this 
follow-up study, is quite reliable after a period 
of years, and there also appears to be a tend- 
ency for the predictions of teaching efficiency 
to improve after a period of years. 


Summary and Conclusion 





This section of the follow-up study was pri- 
marily concerned with the reliability of the pre- 
dictions of teaching efficiency as made in the 
original study and any significant change in 
teaching efficiency that has occurred during the 
lapse of time between the two studies. A secon- 
dary problem was to ascertain of certain selec- 
ted variables, which are listed in Section I, have 
any relationship with teaching success. 


The following conclusions seem to be war- 
ranted from the data obtained in the investi- 
gation: 


1. The predictions of teaching efficiency 
made in the original study have improved 
rather than deteriorated during the lapse 
of years between the original study and 
this follow-up study. 


2. This study has not found a significant 
change in the efficiency of teaching dur- 
ing the years between the two investi- 
gations. However, there is a tendency 
for the teachers to receive a higher 
rating in this follow-up study than they 
did in the original study. 


3. There is a significant relationship be- 
tween the M-Blank rating obtained in 
the Jones study and the M-Blank rating 
obtained in this follow-up study. 


4. There is a significant relationship be- 
tween the M-Blank rating and the rating 
on the Industrial Commission blank. 


5. A significant relationship was not found 
between the criterion of this follow-up 
study and the ratings made on the blank 
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employed by the University of Wisconsin 
and the Wisconsin Industrial Commission. 


. A significant relationship was found in 
this section between the number of 
teaching positions held and salary in- 
creases. 


Study Number Four - Lins 





Teacher Measures 


As 1n the three preceding sections of this 
follow-up study, the primary purpose of this 
section is to determine if the predictions of 
teaching success made by Lins in the original 
study have improved or deteriorated during the 
years between the two studies and also to de- 
termine if there has been a significant change 
in teaching efficiency during the same period 
of time. A secondary purpose is to determine 
if there is any relationship between teaching 
success and the six variables mentioned pre- 
viously in Section TI. 


The criterion for this section of the follow- 
up study is also that of supervisory ratings on 
the Wisconsin Adaptation of the M-Blank. As 
in Jones study, discussed in the previous sec- 
tion, supervisory ratings employing the M-Blank 
was one of the criteria used in the investigation 
made by Lins. The criteria of supervisory 
ratings employed by Lins, however, differed 
from the criteria of supervisory ratings of the 
Jones study and of this follow-up study in that 
it was a composite of ratings made on the 
M-Blank by five different raters. 


The Pearson Product-Moment coefficient of 
correlation between the composite M-Blank 
rating of the Lins study and the M-Blank rat- 
ing of this follow-up study is .73 + .14, which 
is significant at the 1.64 percent level. How- 
ever, a review of the Lins study shows that a 
coefficient of correlation of only .20 + .17 was 
obtained between the composite rating on the 
M-Biank in 1943 and the ratings made by the 
principals in 1945. 


Although there is a highly significant degree 
of relationship between the M-Blank ratings 
in the original study and in this follow-up study, 
the correlation between the M-Blank ratings 
obtained in the original study and the criterion 
of pupil gain was only .19. This low correla- 
tion is consistent with the results in the other 
three studies of this follow-up study, as well 


as with the findings of other studies previously 
cited. 


A comparison of the composite M-Blank 
ratings obtained by Lins and the M-Blank cri- 
terion ratings obtained for this follow-up study 
is found in Table XIII. A difference of .24 be- 





JOURNAL OF EXPERIMENTAL EDUCATION (Vol. XVIII) (Septer 


TABLE xi 


COMPARISON OF THE PREDICTED SCORES OF TEACHING EFFICIENCY 
USING THREE VARIABLES* AND THE ACTUAL SCORES ON THE M-BLANK 
IN THE ORIGINAL STUDY BY JONES 





Teacher Predicted Actual Score 
Number Score in Jones’ Study Difference 
13 3.93 3.00 -.93 





14 3.41 3.75 , 34 
15 3.90 3.63 
16 3.86 4.00 
18 3.76 4.00 
19 ° 4.00 
20 ° 4.13 
24 4.75 
26 o 4.13 
27 2.25 
30 A 3.00 
33 

















*Three Variables: 


1. Freshman-Sophomore Grade Point Average 
2. Junior-Senior Grade Point Average 
3. Bell Adjustment Inventory 
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TABLE XI 
COMPARISON OF THE TEACHER RATINGS ON THE M-BLANK IN THE 


LINS STUDY AND THE M-BLANK RATINGS IN THE PRESENT 
FOLLOW-UP STUDY 





Teacher Present M-Blank Lins M-Blank Difference Gain 
Number Rating Rating of Present 





4.11 4.25 -.13 
2.94 2.75 19 
3.89 3.50 39 
3.39 3.25 14 
3.94 4.10 -.16 
3.94 3.40 54 
4.28 4.10 
3.89 4.10 
3.56 3.75 
3.44 3.00 
4.33 











3.33 





M = 3.75 
O= .42 





TABLE XIV 


ZERO ORDER COEFFICIENTS OF CORRELATION, STANDARD ERROR, 
AND SIGNIFICANCE LEVEL BETWEEN TWO RATINGS USED IN THE 
FOLLOW-UP STUDY AND THE M-BLANK (LINS STUDY) 








Percent Level 
Rating Scale of Significance 


M-Blank Rating in the 
Lins Study 1.64 





Rating Blank Used by the 
Wisconsin Industrial Com- 
mission 
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tween the mean of the ratings in the original 
study and the mean of the follow-up study is 
found. Using the formula for the critical ratio 
between the means, it is found that the differ- 
ence between the means is significant at the 
61.02 percent level. The change as measured 
by the criterion is too small to be statistically 
significant; however, Table XTII shows that 
eight teachers have shown some improvement 
as measured by the criterion, and one of them 

‘ increased nearly one and one-half points, using 
the five-point scale explained in Section II. 
Though the data available in this follow-up 
study is not sufficient to say definitely that the 
teachers have improved since the original 
study, the criterion employed does seem to in- 
dicate a general tendency toward improvement. 


The coefficient of correlation between the 
rating blank used by the Teacher Placement 
Department of the Wisconsin Industrial Com- 
mission and the criterion is -.02 + .45, Table 
XIV. This correlation is too small to be sig- 
nificant in this case, and it also disagrees with 
the results obtained in the Jones study, in which 
a positive coefficient of correlation was ob- 
tained between the criterion of this follow-up 
study and the rating on the blank used by the 
Wisconsin Industrial Commission. 


The low correlation between the Industrial 
Commission’s blank and the criterion of this 
follow-up study. in contrast to the significant 
correlation between the previous composite 
M-Blank rating and the criterion of this follow- 
up study. may be due to the small number of 
cases and/or that different people made the 
ratings. 


Data Booklet Information 





The Data Booklet was constructed to obtain 
information which would help measure and pre- 
dict teaching success. Some of the information 
obtained from the Data Booklet relative to the 
Lins study is found in Table XV. It shows a 
mean of four years of teaching, with a mean of 
1.7 years in one teaching position. The number 
of teaching positions held varies from one to 
four. One teacher taught four years and in four 
different schools. The mean annual beginning 
salary was $1390, with $1230 as the lowest an- 
nual beginning salary and $1500 as the highest 
beginning salary. The mean salary received in 
1847 was $2114.29, with $1875 as the lowest 
annual salary and $2350 the highest annual sal- 
ary. The teacher who had moved four times in 
four years received the highest beginning an- 
nual salary and also the highest annual salary 
in 1947. 


Fhe Pearson Product-Moment coefficient of 
correlation between the amount of increase in 
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salary, Table VII, since beginning teaching and 
the number of teaching positions held is .05 + 
-41, which is significant at the 92.24 percent 
level. From this we might conclude that the 
teachers who move frequently do not increase 
their salary more than the teachers who do not 
change teaching positions. 


In an attempt to provide some evidence as to 
whether the most efficient teacher, as rated by 
the criterion, was receiving the highest salary, 
coefficients of correlation were calculated. The 
coefficients of correlation between the criter- 
ion and salary in 1947 is -.70+ .20, which is 
significant at the 00.86 percent level, (Table 
XIX, Appendix A). It appears that the better 
teachers among this group, as measured by the 
criterion, are receiving less salary than the 
poorer teachers. As has been pointed out in 
Study Number One of Section MI, previous in- 
vestigations have found a very low correlation 
coefficient between teaching efficiency and 
salaries. 


Insufficient data prevented the calculation of 
coefficients of correlation for this part of the 
study between the criterion and the blanks used 
by the Placement Bureau of the University of 
Wisconsin and the State Department of Public 
Instruction. 


Additional information obtained from the 
Data Booklet will be combined with the Jones 
study and analyzed in the following section. 


Six Qualitative Factors Combining the Jones 
and Lins Studies 








The information obtained from the Data Book- 
let and used here could not be numerically ex- 
pressed. A biserial coefficient of correlation 
was accordingly employed in determining the 
amount of relationship between the criterion of 
this follow-up study, numerically expressed, 
and the other variable expressed in two alter- 
native categories. As stated in Study Number 
Two, Section III, the process observed in cal- 
culating the biserial coefficient of correlation 
is based on some of the properties of the 
normal probability curve. It was assumed, also, 
for this section, that the dichotomous traits 
were normally distributed and continuous, and 
that the relationship between the two factors was 
linear. 


As in the combined information of the LaDuke 
and Rostker studies, a biserial coefficient of 
correlation was calculated for the following 


/qualitative traits: 


1. The father was or was not a farmer. 


2. The individual did or did not participate 
in athletics in high school and/or college. 





be 
Qa 
Zz 
a 
& 
a 


(September 1949) 





6z° PI IZ$ 


00° 06st $ 





OSéz 
SL8T 
Osé6T 
00TZ 
0002 


o0sze$ 


Oost 
OST 
00FT 
Oost 
00st 
oset 


uv 


“og 
ewoy 
yooeds 
yst[3ug 
“og 
eawoy 
“9 
ewoy 
ak | 
ewog 
eoueps 


£ 
T 


v 
v 





Areres 





Areyes 
Suyuuy3eq 





wane 
epery 10 
yefqng 





Uy Snel 
sTOOYDS 
JO “ON 








[0oyd§ 43H 
puofeg Su} 
-UfELL “SIX 




















(261) AGA.LS SNIT - NOLLVANOAN] USHOVAEL 


AX TIAVL 





JOURNAL OF EXPERIMENTAL EDUCATION 


TABLE XVI 


BISERIAL COEFFICIENTS OF CORRELATION, STANDARD ERROR OF COR- 
RELATION AND LEVEL OF SIGNIFICANCE BETWEEN SIX DICHOTOMIES 
AND THE CRITERION OF THE FOLLOW-UP STUDY (LINS AND JONES 
STUDIES COMBINED) 





Dichotomy 


Percent Level 
of Significance 





Father was or was not 
a farmer 


Did or did not participate 
in athletics in high school 
and/or college 


Did or did not participate 
in musical activities in 
high school and/or college 


Did or did not participate 
in dramatics in high 
school and/or college 


Did or did not participate 
in community activities 
as a teacher 


Did or did not belong to a 
professional organization 


4.88 





N= 18 


- The individual did or did not participate 
in musical activities in high school 
and/or college. 


- The individual did or did not participate 
in dramatics in high school and/or 
college. 


. The individual did or did not participate 
in community activities as a teacher. 


6. The individual did or did not belong to 
a professional organization as a teacher. 


The standard error of the biserial coefficients 
of correlation, the biserial coefficients of cor- 
relation, and the percent level of significance 
are found in Table XVI. 


The biserial coefficient of correlation be- 
tween the variable of whether the father was 
or was not a farmer andthe criterion is -.48 + 
-05, which is significant at the 4.88 percent 
level. Selecting as the base that the father was 





a farmer, the biserial r indicates a negative re- 
lationship between success as measured by the 
criterion of this follow-up study and the fact 
that their father was a farmer. This correla- 
tion would seem to indicate that there was less 
possibility of their success if their father was 

a farmer. The negative biserial r in this part 
of the study between the criterion of this follow- 
up study and the occupation of the father as a 
farmer does not agree with the positive correl- 
ation found for the group of teachers in the orig- 
inal LaDuke34 and Rostker35 studies. This may 
be due to the fact that the teachers of the LaDuke 
and Rostker studies were elementary teachers 
in rural state-graded and village schools, while 
the teachers of the Jones and Lins studies were 
secondary school teachers located in urban 
areas. 


A biserial correlation of -.33 + .07, which is 
significant at the 17.40 percent level, is found 
between the criterion and participating in com- 
munity activities as a teacher. The only signif- 
icant positive biserial correlation in this part 





34, LaDuke, op. cit. 
35. Rostker, op. cit. 
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of the follow-up study is found between the cri- 
terion of this follow-up study and participating 
in musical activities in high school and/or col- 
lege. The biserial correlation was .53 + .03, 
which is significant at the 2.86 percent level. 
This coefficient of correlation, however, is not 
in agreement with the findings of previous 
studies by Durlinger36, Bossing37, and Boyce38. 


The other biserial correlations between the 
criterion and the six qualitative factors fall 
between .00 + .22 and .38 + .06, with none sig- 
nificant at less than the five percent level. 


Prediction of Teaching Success - Lins Study 





The primary purpose of this section of the 
follow-up study is to determine if the predic- 
tions of teaching efficiency made by Lins in 
Study Number Two of his thesis were valid and 
reliable. 


The following three factors were among those 
used by Lins for the prediction of teaching 
success and were selected for use in the follow- 
up study: 


1. Composite of two interview ratings on 
general impressions. 


2. Rating of ability in assigned activity in 
Education 73. 


3. The final grade for practice teaching in 
their major. 


These three factors were chosen for following 
the prediction in this study because, through 
the multiple regression equation, Ro.12 was 
found to be .56 and also the beta coefficients 
which indicate the relative weights of the inde- 
pendent variables in the multiple correlation 
equations were available. 


The general form for the multiple regression 
equation, as given in Section III, Study Number 
Two, was used. 


The proper calculations as indicated by the 
equation give the following multiple regression 
equation in raw score form: 


Xo = .031 X; + 040 Xo + .020 Xq + 3.103 


The scores obtained by the teachers, as found 
by Lins, for the three variables that were em- 
ployed for prediction in this study, were sub- 
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stituted for the independent variables in the 
multiple regression equation. 


The predicted scores obtained, as contrasted 
with the actual scores obtained in this follow- 
up study, are found in Table XVII. For five of 
the teachers, the predicted score deviates 
from the actual score, as measured by the cri- 
terion of this follow-up study, less than 0.5 of 
a point, and all of them deviate by less than 1.0. 


A further comparison of the predictions is 
made in Table XVIII where the predicted score 
is compared with the actual score obtained on 
the M-Blank in the Lins study. It can be seen 
that only three of the predicted scores deviate 
from the actual by less than 0.5 and all scores 
deviate less than 1.0. 


The mean deviation between the predicted 
score and the actual score in this follow-up 
study is .44, while the mean deviation between 
the predicted and actual score in the Lins 
study is .59. This would seem to indicate that 
the predictions using the above stated variables 
were quite reliable and also that the predictions 
have improved during the several years between 
the two studies. Although the difference be- 
tween the original and present rating is not sta- 
tistically significant, there appears to be some 
improvement in the teaching efficiency of the 
participating teachers. 


Section IV 


SUMMARY AND CONCLUSIONS 


This follow-up study has attempted to deter- 
mine the validity and reliability of the predic- 
tions of teaching efficiency made in the orig- 
inal studies and has also attempted to ascer- 
tain if any significant changes have occurred 
in the teachers during the varying number of 
years between the original studies and the 
present study. 


Since this investigation has made an indepen- 
dent analysis of most of the data for each of the 
previous studies, the summaries will also be 
made separately. 


LaDuke Study39 


The information obtained for this part of the 
follow-up study seems to warrant the follow- 
ing conclusions: 


1. The closer relationship between the 





36.. Durlinger, op. cit. 


37, Bossing, Op. cit. 


38. Boyce, op. cit. 


39. LaDuke, op. cit. 
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TABLE Xvi 


A COMPARISON OF PREDICTED SCORES OF TEACHING EFFICIENCY 
USING THREE VARIABLES* AND THE ACTUAL SCORES SHOWING 
TEACHING SUCCESS AS MEASURED BY THE M-BLANK IN THIS 
FOLLOW-UP STUDY (LINS STUDY) 





Predicted Actual 
Scores Scores Difference 


3.43 4.11 -68 





3.43 2.94 ~.48 
3.38 51 
3.40 
3.46 -49 
3.50 -78 
14 
85 











~.02 





M = .44 


*1. Composite of two interview ratings on general impressions. 
2. Rating of ability in assigned activity in Education 73. 
3. Final grade for practice teaching in major. 


** Teachers 6, 8, and 10 omitted because of lack of data. 
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TABLE XVI 


A COMPARISON OF THE PREDICTED SCORE IN THE LINS STUDY 
USING THREE VARIABLES AND THE M-BLANK RATING IN THE 
LINS STUDY 





Actual 
Score 





4.25 
2.75 
3.50 
3.25 
4.10 
4.10 
3.75 
4.00 
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measures employed in the original study 
and the measures employed in the follow- 
up study after a period of eight years, 
seems to indicate an improvement in the 
predictions of teaching efficiency. 


. There appeared to be no significant change 
in the teaching efficiency of the teachers 
during the years between the two studies. 


- The M-Blank ratings in this follow-up 
Study are not significantly correlated 
with the ratings obtained on the super- 
visory rating scales employed by LaDuke. 
The correlation coefficients between the 
criterion of this follow-up study and the 
ratings by the county superintendents, 
using the Michigan Rating Scale, Torger- 
son Rating Scale. and the Almy-Sorenson 
Scale, were .77. .73, and .81 respectively; 
the correlation coefficients between the 
criterion and supervising teachers’ rat- 
ings. using the above listed rating scales, 
were .42. .46, and .71, respectively, none 
of which are significant at the five per- 
cent level or less. However. the rela- 
tionship is closer in the follow-up study 
than in the original study, which would 
seem to indicate an improvement in the 
predictions of teaching success. 


- The correlation between the M-Blank rat- 
ings in the follow-up study and the cri- 
terion of pupil gain used by LaDuke was 
-14 + .44, which is not large enough to be 
significant. 


. When the information of the LaDuke and 
Rostker studies is combined. there seems 
to be a positive relationship between tea- 
ching success as measured by the criter- 
ion of this follow-up study and the fact 
that their father was a farmer (r = .36). 


. The correlation coefficient between the 
criterion of this follow-up study and in- 
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terion of the follow-up study and the low 
mean deviation between the predicted and 
actual scores makes it appear that the 
predictions of teaching ability made in the 
original study, after a lapse of eleven 
years have improved rather than deter- 
iorated. 


2. The information obtained does not show 
a significant change in teaching efficiency 
during the time between the two studies. 


3. The correlation between the criterion of 
this follow-up study (M-Blank) and the 
criterion of the original study (pupil gain) 
is .35 + .27, which is not large enough so 
that it cannot be attributed to chance. 


4. The correlation coefficients of .27, .19 and 
-11 between the criterion of this follow-up 
study and the supervisory ratings in the 
original study are not large enough to be 
significant, but they do show a closer re- 
lationship between the two measures than 
the correlations of -.02, .15, and -.01 ob- 
tained by Rostker. 


5. A rural elementary teacher is more likely 
to attain success as measured by the cri- 
terion if her father is a farmer (biserial 

= .36). 


6. There is a negative relationship between 
the teaching suiccess as measured by the 
criterion and participation in community 
activities. The correlation coefficient is 
-.47. 


7. A significant correlation coefficient of 
-85 is found between teaching success as 
measured by the criterion and the salary 
received in 1947. 


Jones Study4! 


The analysis of the data for this part of the 
study seems to warrant the following conclusions: 


crease in salary was .09, which is not 
statistically significant. 


- Participation in community activities 1. Although the difference between the means 


shows a negative relationship with the 
criterion of this part of the follow-up 
study (r = .48). 


Rostker Study40 


of the M-Blank ratings in the original study 
and the M-Blank ratings in the follow-up 
study is not large enough to be significant, 
it does indicate a positive gain and a tend- 
ency for the teachers to improve in teach- 


ing efficiency as measured by the criter- 
The following conclusions seem warranted ion. 
for this part of the follow-up study: 
2. The predictions of teaching efficiency 
made by Jones, using the three variables 
employed in the follow-up study, have im- 


1. The closer relationship between the cri- 
terion of the original study and the cri- 








40. Rostker, op. cit. 


41. Jones, op. cit. 
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proved after a lapse of four years. 


3. The correlation coefficient between the 
supervisory ratings in the original study 
and the supervisory ratings in the pres- 
ent study is .48, which is significant at 
the 2.66 percent level. 


4. Of the three other supervisory rating 
scales employed, only the one used by 
the Industrial Commission of Wisconsin 
shows a significant relationship between 
it and the criterion of the follow-up study. 


5. The correlation coefficient of .66 between 
increase in salary since beginning teach- 
ing and the criterion is large enough to be 
significant at the three percent level. 


6. There is not a significant relationship 
between the criterion of this follow-up 
study and the salary received in 1947 
(r = .38). 


Lins Study 42 


For this part of the follow-up study, the fol- 
lowing conclusions seem to be warranted: 


1. The mean deviation between the predic- 
ted scores of teaching efficiency and the 
actual scores of teaching efficiency in 
the follow-up study is less than the mean 
deviations between the predicted and 
actual scores in the original study. From 
this, it appears that the predictions of 
teaching success have improved rather 
than deteriorated during the four years 
between the two studies. 


Although the difference between the 
means of the criterion of this follow-up 
study and the criterion of a composite 

of supervisory ratings in the original 
study is not large enough to be signifi- 
cant, it seems to indicate a tendency 
toward improvement in teaching effic- 
iency during the period of four years be- 
tween the two studies. 
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3. The correlation coefficient of .73 between 
the criterion of this follow-up study, which 
is a rating by one person on the M-Blank, 
and the composite of five different ratings 
by five different people on the M-Blank 
for the original study is large enough to 
be significant at the.1.64 percent level. 
This would seem to indicate a significant 
degree of reliability for the M-Blank. 


. There is a significant correlation between 
the criterion of this follow-up study and 
salary received in 1947. 


. A negative biserial r of -.48 between the 
criterion and whether the teacher’s 
father was or was not a farmer seems to 
indicate that the teachers of the Lins and 
Jones studies have greater possibilities 
of succeeding if their fathers are not 
farmers. 


. For the teachers of the Lins and Jones 
studies, there appears to be significant 
relationship between participation in 
musical activities and success as meas- 
ured by the criterion of this follow-up 
study. 


In general it seems that the predictions of 
teaching efficiency made in each of the four 
studies, which are a part of this follow-up 
study, have improved during the lapse of a 
varying number of years between the original 
studies and this follow-up study. It also ap- 
pears that there has been a slight improvement 
in the teaching efficiency of the groups partici- 
pating in the original studies, but has not been 
enough to be statistically significant. Even if the 
several probabilities are combined, the predic- 
tions for the four original studies are not statis- 
tically significant. 


The relationship between the criterion of pupil 
gain and supervisory ratings was not significant 
in any instance and the relationship between su- 
pervisory ratings in general, was not significant. 


The relationship between other factors consid- 
ered was not consistent enough to set a pattern. 





42. Lins, op. cit. 








A STUDY OF SOME FACTORS IN 
PUPIL GROWTH 


KENNETH PETER RIESCH* 


Mission House College 
Plymouth, Wisconsin 


SECTION I 
The Problem 


1.1 Introduction 


Educators continue to be interested in factors 
in pupil growth. Many studies have been made 
of pupil growth and the attending circumstances. 
By and large these investigations have been 
concerned with the construction, measurement, 
and/or evaluation of single factors contribut- 
ing to scholastic change. Few reports are con- 
cerned with the interplay of several factors. 


To be able to predict pupil growth more ade- 
quately it would seem that there is a need for 
multiple factor analysis and the weighting of 
numerous combined factors. In addition, there 
must be means of measuring factors affecting 
this growth either through instruments already 
refined or more valid and reliable instruments 
set up. It is necessary, further, to be able to 
measure adequately the outcomes of growth 
and to determine what growth or changes are 
essential to scholastic success. 


1.2 Statement of the Problem 





The investigation herein reported is a study 
of certain factors thought to contribute to the 
social and scholastic growth of pupils. The 
factors singled out for special consideration 
are intelligence, past achievement as measured 
by pre-test scores on measures of social ad- 
justment, personality, eonduct, attitude toward 
the teacher, type of school, and teaching effic- 
fency. 





More specifically, this study will attempt to 
answer the following questions: 


1. Are there significant pupil changes in 
achievement, conduct, attitude toward the 
teacher, social adjustment, and personality 
taking place over a three-month period of 
time ? 


2. Do factors such as citizenship, intelli- 
gence, conduct, teacher attitudes, social adjust- 
ment, and personality contribute to scholastic 
success as measured by gain in achievement? 


3. Can gains be attributed to the teacher? 


4. Can scholastic success be predicted thro- 
ugh a single factor or combination of factors? 


5. Can better prediction of academic success 
be made for rural pupils than for a combined 
rural-urban group? 


6. Are there characteristic differences in 
attitudes, adjustment, and personality between 
rural and urban pupils? 


7. Are factors such as age, teaching exper- 
fence, pre-service education, social adjustment, 
and intelligence associated individually or in 
combination with teaching efficiency? 


8. Can valid predictions of teaching effic- 
iency be made on the basis of the above factors? 


9. Are measures of teaching success, i.e., 
ratings using the M-Blank and a criterion as 
measured by residual pupil gain, related? 
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SECTION 1 





The Procedure of Investigation 
11.1 Design of the Study 


An attempt is made in this investigation to de- 
termine whether any significant changes took 
place over a three month period in social adjust- 
ment, attitudes, information, and personality of 
seventh and eighth grade pupils in rural, state 
graded, and city schools. Various data were 
collected in 20 school units composed of 22 
teachers in 12 rural, six state graded, and two 
city schools located in Southeastern Wisconsin. 
The county Superintendent of Schools cooper- 
ated with the investigator in the selection of the 
school units and teachers, used in this study. 




































































The school units were broken down into three 
divisions which he defined as, (1) rural some- 
what isolated, (2) rural not isolated, and (3) 
state graded with the first division being com- 
posed generally of schools located on other 
than main highways. Since rural schools in 
Wisconsin often have a limited enrollment, care 
was taken to select schools having at least a 
total of six seventh-and eighth-grade pupils. 


Each teacher selected for the research pro- 
ject was notified by the county Superintendent 
of Schools, and if any objection arose provision 
was made to make a substitution. In his sub- 
jective opinion, each division represented two 
good, two fair, and two poor teachers. 










































































Upon the suggestion of the writer, the superin- 
tendent later filled out the Wisconsin Adaptation 
M-Blank rating (Appendix A.1)* for each of the 
rural teachers. This represented a more objec- 
tive teacher evaluation than he used in his 
first divisional rating. The superintendents 
of the two city school systems were requested 
to fill out the Wisconsin Adaptation M-Blank 
for the two teachers in his respective seventh 
and eighth grades. 

































































A week in advance of the testing program, 
the writer and a rural supervising teacher vis- 
ited all of the teachers in the 20 school units 
which were to be incorporated in the study. 
The purpose of the visit was to answer any 
questions and to attempt to have the teachers 
feel that they could make a contribution to ed- 
ucation by encouraging their pupils to be co- 
operative in the testing program. 



































The testing program was carried out during 
the second semester of the 1944-45 school 
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year. The investigator and his wife adminis- 
tered the tests. Pre- and post-tests were 
given to the students with the schools divided 
so that the same person gave the pre- and 
post-test for a particular school. This method 
was employed to help control some of the dif- 
ficulties that often arise, such as, giving un- 
satisfactory directions; repeating of the test 
by another administrator thus giving unsatis- 
factory results since pupils may not be at 
ease; lack of interest on the part of adminis- 
trators; etc. 


It was desirable that no student in any school 
be given a time advantage over any other stu- 
dent in that school. Therefore, the time be- 
tween all pre-tests and post-tests was exactly 
twelve school weeks to the hour of beginning 
administration of the test. That is to say, ifa 
pre-test in a certain field was given at 9 
o’clock, the post-test was given at 9 o’clock 
twelve school weeks later. 


The 22 teachers were asked to fill out a 
sheet known as the Jones’ Observational Scale 
for Rating Citizenship at School (Appendix A.2) 
for each seventh and eighth grade pupil. A 
second sheet (Appendix A.3) was also given to 
the teachers to check any of the 42 items per- 
taining to Home Environment. The informa- 
tion obtained on both of these sheets was very 
inadequate, and therefore, after careful study 
it was decided not to include the results in this 
investigation. 


A study relevant to relationship of home en- 
vironment and personality adjustment, however 
was carried out by Stott.4 From the investi- 
gation involving 1,217 children in 14 village 
schools and 46 one-room country schools, he 
concluded that ‘‘.......in general, the way a 
child feels about himself in relation to the 
world about him is positively associated with 
the sort of relationships which exist between 
himself and his family. There appears to be 
something about living on a farm near enough 
to the village or community center and about 
attending the village school which is relatively 
favorable to his psychological well being.’’ 


11.2 Description of Measures Used 





After a review of instruments designed to 
measure factors of intelligence, social adjust- 
ment, attitudes, personality, achievement, and 
teaching ability, certain tests were selected on 
the basis of applicability to the study under 
consideration. 
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* References to Appendices will be found 
of Wisconsin Library. 








in original thesis on file, University 





1. Leland H. Stott, “Some Environmental Factores in Relation to the Personality 
Adjustment of Rurel Children," Rural Sociology, X (December 1945). 
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The Wisconsin Adaptation M-Blank as repor- 
ted previously was used to rate the teachers. 
The superintendent of each of the city schools 
rated the seventh and eighth grade teacher in 
his respective school. Thus a total of 22 
teachers were rated. 


Measuring instruments used in the study with 
the frequency of administration appear in 
Table I. It is noted that wherever possible the 
same tests were given to students and teachers. 


The Cooperative Social Studies Tests (Appen- 
dix A.4), devised by Agatha Townsend and Mary 
Willis, is divided into three parts: I. Fact, 
Skills, and Applications; Il. Terms and Con- 
cepts; and Ill. Comprehension and Interpreta- 
tion. Scaled scores are provided to enable the 
administrator to gain directly interpretative 
information. In using a scaled score a number 
of 50 corresponds to the expected perform- 
ance of an ‘‘average’’ individual with three 
years of study of the subject in grades 7, 8, 
and 9, tested at the end of the 9th grade. A 
table of tentative norms based on approximate- 
ly 5,000 junior high school students was 
recently published. No report has been made of 
validity or reliability. 


The primary purpose of the Washburne Social- 
Adjustment Inventory, Thaspic Edition (Appen- 
dix A.5), is to determine the degree of social 
and emotional adjustment of each pupil. The 
test is arranged so as to give a separate meas- 
ure of development for six traits which are 
slightly correlated with intelligence, and more 
highly correlated with social and emotional 
adjustment. Three of the traits pertain to emo- 
tional adjustments with other people and en- 
vironment, and the remaining three to self- 
organization and self-regulation. 


The Social-Adjustment Inventory Test has 
been used and revised over a period of 10 
years. The test, for purposes of diagnosis, is 
divided into eight subtests, one essay, six ob- 
jective measuring social and emotional adjust- 
ment, and one of truthfulness or test accuracy. 
These are designated by (t) Truthfulness; 

(h) Happiness; (a) Alienation; (s) Sympathy; 

(p) Purpose; (i) Impulse-Judgment; (c) Control; 
and (w) Wishes. The author points out that the 
initials of the subtests spell the word ‘‘Thaspic’’ 
omitting the subtest, Wishes. 


The Social-Adjustment Inventory is not only 
said to be a valid instrument, but also a reli- 
able one. The biserial coefficient of validity 
is reported to be .90. The coefficient of re- 





RIESCH 


liability as determined by a retest of approxi- 
mately 400 college students after an interval 
of one semester was .92. The probable error 
for an individual’s total social-adjustment 
score was 5. For the sub-tests, except for 
the truthfulness score, the approximate range 
of the coefficients of reliability was from .80 
to .90. This information would seem to indi- 
cate that the inventory was sufficiently reli- 
able. 


In a number of cases the pupils’ scores ex- 
ceeded a truthfulness score of 30. The exam- 
iner felt that these pupils could be left in the 
study as these scores would indicate a very 
high degree of social maladjustment. Wash- 
burne contends that these tests should be dis- 
carded because of the low level of truthfulness. 


The Right Conduct Test (Appendix A.6) was 
devised by Harlow A. Wood, Superintendent of 
Schools, Munising, Michigan. The test is 
composed of two parts of which Part I consists 
of 40 short descriptive situations pertaining to 
questions of social conduct. The pupil is told 
to select one of the three answers listed under 
each of the 40 items. Part II is made up of 60 
true-false statements relating to questions of 
social behavior, in which the pupil is told to 
draw a circle around either true or false. 


The purpose of the test is to measure the 
pupil’s social information. At present, coef- 
ficients of reliability and validity are not 
available for the test. Wood has used the 
test in numerous schools. The results indi- 
cate that the instrument has value in deter- 
mining whether pupils have knowledge of 
‘‘right’’ conduct. 


Remmer’s Scale for Measuring Attitude 
Toward Any Teacher (Appendix A.7) has been 
used in several studies with satisfactory re- 
sults. Tschechtelin, Frances, and Remmers2 
found the scale of value in their study carried 
out in 1940. The scale consists of 45 state- 
ments about the teacher. Pupils are directed 
to place a plus sign in front of each statement 
which is true about the teacher, and are assured 
that their scores will in no way affect their 
grades in any course. 


Scale values are assigned to each of the 45 
statements. The final score is merely the 
scale value of the median item among those 
that were checked. The measure begins with 
a positive statement, ‘‘is perfect in every way,’’ 
and ends with negative statements such as the 
last, ‘‘is a failure as a classroom teacher.’’ A 
scale so constructed is an attempt to supply 





on 


2. M. & Techechtelin, Md. T. Frances, end 


of School Children Toward Their Teechers," 


ogy, XXXI (March 1940), ov. 195-203. 


H. H. Remmers, “Measuring Attitudes 
Journal of Educational Psychol- 
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TABLE I 


TEACHER AND PUPIL TESTS ADMINISTERED WITH FRE- 
QUENCY OF ADMINISTRATION 





Pupil Tests 


Times 
Given 


Teacher Tests 





Townsend and Willis 
Cooperative Social 
Studies Test 


Townsend and Willis 
Cooperative Social 
Studies Test 





Washburne Social Ad- 
justment Inventory 


Washburne Social Ad- 
justment Inventory 





Wood Right Conduct 
Test 


Wood Right Conduct 
Test 





Remmers Scale for 


Remmers Scale for 


Measuring Attitude 
Toward Any Teacher 





Sullivan, Clark, and 
Tieg California Test 
of Mental Maturity 


Sullivan, Clark, and 
Tieg California Test 
of Mental Maturity 





Clark, Tieg, and Thorpe 
California Test of Per- 
sonality 





Clark, Tieg, and Thorpe 
California Test of Per- 
sonality 
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data on the relationship between pupils’ atti- 
tudes toward their teacher. 


The California Test of Mental Maturity, inter- 
mediate series (Appendix A.8), and advanced 
series (Appendix A.9), were administered to 
the pupils and teachers respectively to deter- 
mine their intellectual capacity and mental 
maturity. The test has several significant 
features such as, (1) a diagnostic evaluation of 
those mental abilities that are related to suc- 
cess in various types of school activity, (2) a 
diagnostic profile designed to point out partic- 
ular abilities, (3) compositie experience of ed- 
ucational leaders in the mental measurement 
field, and (4) three mental age and intelligence 
quotients which are, a non-language M.A. and 
1.Q., a language M.A. and I.Q., and the M.A. 
and 1.Q., characteristic of familiar intelligence. 


The battery contains such tests as visual acu- 
ity, auditory acuity, and motor coordination. 
The reason for this organization is to detect 
pupils with serious defects which might prevent 
the obtaining of valid diagnosis of their mental 
maturity. Each of the batteries is built upon 
the idea of taking into account recognized ‘mental 
processes or functions. 


The various sections of the test show a high 
reliability. Data was gathered using 400 stu- 
dents in grades 7, 8, and 9, and 100 adults. 

An analysis was made to determine reliability 
coefficients computed by the split-halves meth- 
od and corrected by the Spearman-Brown form- 
ula. The results are shown in Table II. 


‘*The probable error of estimate for an indiv- 
idual score varies from three months of mental 
age when the reliabilities for single grade are 
-94 and above to five months when the reliabil- 
ities are .87 to .89. The probable errors of 
estimate of individual I.Q.’s for similar relia- 
bilities are 2 and 4 points, respectively.’’ 


The California Test of Personality (Appendix 
A.10) was used to measure the pupils’ and 
teachers’ ability to meet social and personal 
problems in every day Mving. The major pur- 
pose of the test is to discover to what extent 
the individual is adjusting himself to conditions 
with which he is confronted. A secondary pur- 
pose is to measure whether the individual is 
developing a normal, happy, and socially effec- 
tive personality. The test is divided into two 
parts. Section I deals with self adjustment 
based upon feelings of personal security, such 
as, self-reliance, sense of personal worth, and 
sense of personal freedom from nervous symp- 
toms. 
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Part II deals with social adjustment based 
upon feelings of social security such as social 
standards, social skills, freedom from anti- 
social tendencies, family relations, and com~- 
munity relations. The two sections aid in de- 
termining whether or not the student’s basic 
needs are being satisfied and whether or not 
he is developing a sense of self-realization 
and social acceptance. The items included 
in the personality test were selected on the 
basis of: judgments of both teachers and prin- 
cipals as to their relative validity and signif- 
icance, the reactions of students, the extent to 
which student responses and teacher appraisals 
agreed, and a study of items using the bi-serial 
r technique. 


The reliability of the California Test of Per- 
sonality between Section I and Section II is .74. 
This would seem to indicate that it is desirable 
to study the student from the standpoint of self 
and social adjustment since the predictive value 
is only about 53 percent. Correlations were ob- 
tained for 792 cases by the split-halves method 
corrected by the Spearman-Brown formula. 

An r of .932 and a probable error of estimated 
score of 3.7 was found for Section 1, Self Ad- 
justment. Section II, Social Adjustment, yielded 
an r of .873, and a probable error of estimated 
score of 2.6. For total adjustment the r was 
-932, and the probable error of estimated score, 
3.7. 


11.3 Procedure of Scoring Tests 





All of the tests used in this study were cor- 
rected by the writer. Each test was checked 
twice in order to eliminate possible errors 
that may have occurred in correcting them. 
Several of the tests could have been machine 
scored, but the writer felt that a better under- 
standing of the problem and data might be pos- 
sible through hand scoring each particular test. 


In scoring the Cooperative Social Studies Test, 
the essential information needed is printed on 
scoring keys. Space is provided at the end of 
each test for entering the necessary figures for 
computing all raw scores. The number of right 
answers are entered in the appropriate space, 
and the amount to be subtracted depends upon 
the numper of wrong answers as shown in the 
chart. All fractions of one-half or less are 
dropped and fractions greater than one-half 
increase the amount to be subtracted to the 
next higher integer. If the resulting difference 
becomes a negative, the raw score is zero. Up- 
on completing the raw scores, the scaled scores 
were computed by referring to the conversion 
tables which are provided. 





3. Ibid., p. 2 
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The Washburne Social-Adjustment Inventory 
provides scoring stencils or keys containing a 
pattern of holes which reveals the incorrect 
answers of either two or three of the subtests. 
Each hole in the stencil has a letter designating 
the subtest to which the item belongs and a 
number showing the weight to be given to the 
item if a black mark appears. The score of the 
‘‘Wishes’’ is not necessary, since norms are 
based on the total score made on the objective- 
type subtests. For this reason the ‘‘Wishes’’ 
were not used in calculating the final score in 
this study. 


The Right Conduct Test consisted of two parts 
as previously stated. Part I has a total of 40 
items, and Part Il a total of 60 true-false state- 
ments. The final score is obtained by subtrac- 
ting the number wrong from 100. 


Remmers’ Scale for Measuring the Attitude 
Toward Any Teacher has 45 statements about 
the teacher. Remmers assigned scale values 
which are added for those items checked, and 
the final score is merely the median. 


The California Test of Mental Maturity has a 
key to aid in scoring the test. Each item is 
considered either right or wrong. Each correct 
item is marked with a C and if two or more an- 
swers are given the answer is considered 
wrong unless the pupil has attempted to cross 
out or erase one of the items. The number 
right is then recorded on the bottom of each 
page. and transferred later to the first page of 
the test. 


An answer key is used in scoring the Cali- 
fornia Test of Personality. There is a total 
of fifteen items in each subsection and the 
score for each column is the number of student 
responses which conform with the answer key. 
Each correct answer is marked witha C. If 
both YES and NO are marked the answer is con- 
sidered wrong unless the pupil has attempted to 
erase or change one of the YES or NO answers. 
The person doing the scoring must be careful 
to use the correct column of the answer key for 
each test column. One_adds self adjustment and 
social adjustment scores to obtain the total ad- 
justment score of each pupil. 


II.4 Description of Pupils and Teachers 





This investigation began with a total of 301 
seventh- and eighth-grade pupils in 12 rural, 
six state graded, and two city school systems. 
The number of pupils under consideration was 
reduced to 258 because of incomplete data; 43 
pupils were eliminated because of being unable 
to comply with all requirements set up, for ex- 
ample, some students moved during the three 
month lapse of time between the administration 
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of the pre-test and post-test. Other students, 
who took the pre-test, were absent at the time 
the post-test was given and therefore were not 
included in the study. The total number of 
students in each of the three general school 
divisions and the number who completed all re- 
quirements are presented in Table III. 


Twenty-two teachers are included in this study. 
Their mean age was 35.6 years with the young- 
est teacher being 20 and the oldest 68. They 
ranged in experience from one to 43 years with 
the average being 12.4 while the least amount 
of pre-service training was one year and the 
maximum was four years. Average training 
was 2.3 years. 


SECTION Ii 


Report of Findings and Interpretation of 
Results 





This investigation has as its primary purpose 
the study of various factors which appear to be 
related individually or in combination to pupil 
change measured over a three month period of 
time. The factors under consideration are 
pupil intelligence, past achievement as given 
by pre-test scores on measures of social ad- 
justment, personality, conduct, attitude toward 
the teacher, and teaching efficiency. 


Ill.1 The Measurement of Pupil Gain 





Pupil gain or growth was measured with re- 
spect to achievement, social adjustment, person- 
ality, conduct, and attitude toward the teacher. 
Pre-tests were given to "01 seventh and eighth- 
grade pupils in 12 rural, six state graded, and 
two city school systems. Three months later 
to the hour a repeat of the tests, hereafter re- 
ferred to as post-tests, was made to all of the 
original group available. Because of illness 
and other factors, 43 pupils did not take all of 
the post-tests and were, therefore, eliminated, 
leaving 258. The latter group, then was studied 
thoroughly. 


Each pupil completing the requirements for 
this study was given the following pre-tests 
and post-tests: Townsend and Willis Cooper- 
ative Social Studies Test, the Washburne 
Social Adjustment Inventory, the Wood Right 
Conduct Test, and the Remmers’ Scale for 
Measuring Attitude Toward Any Teacher. In 
addition, the California Test of Mental Matur- 
ity was given. The mean I.Q. as determined 
from the latter test was 100.69. The standard 
deviation was 13.24. The means and standard 
deviations of all other tests are presented in 
Table IV. 
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The mean of the group under investigation 
was 136.6 for the pre-test and 145.3 for the 
post-test on the California Test of Personal- 
ity. These means are equivalent to the thirty- 
fifth and fiftieth percentiles respectively of 
the group on which the test was standardized. 
No.comparative norms are available for the 
right conduct test. The, 258 pupils stood at 
about the forty-eighth pércentile on the pre- 
test and the fifty-second percentile on the 
post-test of the Cooperative Social Studies 
test when compared with the standardization 
population. These two tests indicate that the 
group studied is average in personality and 
the Cooperative Social Studies. 


All mean gains as shown in TableIV were pos- 
itive with the exception of attitude toward the 
teacher indicating that some factor or group of 
factors, operated to bring the teachers into a 
more unfavorable light during the experimental 
period. The range of the mean gains was from 
a positive 8.667 to a -.285. The California 
Personality had a mean gain of 8.667, the Social 
Studies Achievement 6.961, the Social Adjust- 
ment 3.126, the Right Conduct 2.100, and the 
Attitude of the Teacher with a -.285. 


For purposes of this study, the determination 
of significance or lack of significance in the 
differences was measured with respect to the 
means of the pre- and post-tests utilizing the 
following equation: 


o (Mj -M9) = M, - M2 








V oM12 + Mo” - 2rjg 0102 


where Mj is the variance of the post-test, M2 
is the variance of the pre-test, and rj9 is the 
Pearson-Product-Moment Coefficient of Rec- 
tilinear Correlation between the pre-test and 
the post-test. 


The significance of differences between means 
is presented in Table V . It appears that sig- 
nificant positive changes have been made in 
both conduct and achievement since the proba- 
bility is less than one in 100 that a change of 
this degree could be attributed to chance alone. 


For the other factors, the changes were not 
statistically significant. There is some reason 
to believe, under the assumption that the Rem- 
mers’ Scale for Measuring Attitude Toward 
any Teacher is a valid measure, that factors 
operating both in and out of the school situation 
have not contributed positively but rather have 





‘caused a decrease in attitude toward the teacher. 


If the Washburne social adjustment inventory 
alone is considered, there is evidence of pupil 
maladjustment. On both the pre-test and post- 
test, the mean of the group falls in the low 
normal range. Should this test be a true meas- 
ure of maladjustment, there is indication that 
individual pupils of the type tested are not well- 


adjusted since 20 pupils fell in the classification 


of maladjustment on the pre-test, and 26 fell 
within this range on the post-test. 


From this, one gathers the impression that the 


influences surrounding the child were not con- 
ducive to improvement in social adjustment. 
Eleven percent of the rural, 10 percent of the 
state graded, and two percent of the city pupils 
were shown to be maladjusted on the pre-test, 
while 13 percent of the rural, 11 percent of the 
state graded, and six percent of the city were 


shown to be maladjusted on the post-test. There 


appears to be no significant differences in atti- 


tudes, adjustment, and personality between rural 


and urban pupils (Appendix B.6). 


If1.2 Relationship Between Pupil Gain and 
Factors Thought to Condition Gain 








To determine the relationship between the 
various tests, coefficients of correlation were 
found for all pre-tests, post-tests, and gains. 
These with the errors of correlation and the 
significance level of the correlation are pre- 
sented in Tables VII, VIII, andIX. The error of 
correlation was computed using the equation# 


Er, V¥N-1 


This equation was used since it was the desire 
to determine whether it could reasonably be 
expected that an r as large as the r obtained 
would be possible by chance if the true r is 
zero. 


The significance level of the correlation was 
determined using the following equation:5 Per- 
cent confidence limit 1-2 (area under a normal 
probability curve) at 


z 
1 


\N-1 


Referring to Tables VII and XI , one finds 
jthat the following pre-tests are significantly 
interrelated at less than the one percent level, 


(Vol. XVIII) 





4. Charles 0. Peters and Walter R. Van Voorhis, Statistical ce 6 and Thei 
Mathematical Basis (New York: McGraw Hill, 1940), p. 153. 

5. L. Joseph Line, "The Prediction of Teaching Efficiency," Journal of Experi- 
mental Education, XV (September 1946), p. 43. 
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TABLE IV 


MEANS AND STANDARD DEVIATIONS OF MEASURES 
APPLIED TO PUPILS 





Measures 





Pre-Tests 


California Personality 
Right Conduct 

Social Studies Achievement 
Social Adjustment 

Attitude of Teacher 


Post-Tests 


California Personality 
Right Conduct 

Social Studies Achievement 
Social Adjustment 

Attitude of Teacher 


Gains 


California Personality 
Right Conduct 

Social Studies Achievement 
Social Adjustment 

Attitude of Teacher 














TABLE V 


SIGNIFICANCE OF DIFFERENCES BETWEEN MEANS OF PRE- 
TESTS AND POST-TESTS FOR VARIOUS PUPIL FACTORS 





Standard Error 
Pupil Factor of Difference 
Attitude Toward Teacher -1.95 
California Test of Personality 1.68 
Wood Right Conduct 4.38 
Cooperative Social Studies 9.94 
Washburne Social Adjustment 1.36 
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TABLE VI 


CORRELATIONS BETWEEN GAIN ON PUPIL TESTS AND TEACHING 
EFFICIENCY (AS MEASURED BY SUPERINTENDENT'S 
M-BLANK RATING) 





Factors 





Gain Personality 

Gain Right Conduct 
Gain Achievement 
Gain Social Adjustment 
Gain Attitude 





* Significant at one percent level. 


personality and right conduct, r = .71 + .06; 
personality and social adjustment, r = .65 + .06; 
personality and social studies achievement, 

r = .31+ .06; right conduct and social adjust- 
ment, r = .53 + .06; right conduct and social 
studies achievement, r = .71 + .06; and social 
adjustment and social studies achievement, 

r = .43 + .06. It appears, then, that the tests of 
personality, social adjustment, and social stud- 
ies achievement are measuring, to degree, the 
same thing since they are interrelated. How- 
ever, they are not related to attitude of the 
teacher or intelligence as measured here. 


What has been stated of the pre-tests is also 
true of the post-tests with the exception of the 
fact that the post-test attitude toward the 
teacher is significantly related to intelligence, 
r = .16 + .06, at less than the five percent 
level. As indicated by Tables VII and XII 
when compared with Tables VII and XI , there 
has been change of a greater degree in some 
factors than in others since the coefficients of 
correlation, though still significant at less than 
the one percent level, have changed. The co- 
efficients of correlation are for personality 
and right conduct, .54 + .06; personality and 
social adjustment, .71 + .06; personality and 
social studies achievement, .37 + .06; right 
conduct and social adjustment, .27 + .06; right 
conduct and social studies achievement, .73 + 
-06; and social adjustment and social studies 
achievement, .50 + .06. 


Tables VII and XIII show the general lack of 
relationship between the pupil gain scores on 
. personality, attitude, intelligence, right con- 
duct, social adjustment, and social studies 
achievement. The correlation between gain 
scores for personality and social adjustment 
is .24+ .06; for the gains on achievement and 
social adjustment, .18 + .06; between pre-test 
scores on conduct and gain in achievement, 

= .21; and between pre-test scores on person- 
ality and gain in achievement, r = .20. These 





are significant at the one percent level. 


The composite of gain scores, as outlined 
earlier, was developed by finding the mean 
gain on the personality, right conduct, social 
adjustment, and social studies achievement 
tests for each pupil. Again referring to Table 
X , certain significant zero-order coefficients 
of correlations are found. The following tests 
when correlated with the composite are signif- 
icant at the one percent level: California Men- 
tal Maturity, .26 + .06; post-test attitude, .32 + 
.06; and pre-test right conduct, .20+ .06. — 
These coefficients are low and have very little 
predictive value when individual pupils are 
consider ed. 


Assuming that the pupil group under consid- 
eration is representative of rural pupils, one 
might conclude that teachers who are rated 
high by the superintendents when the superin- 
tendents use the Wisconsin Adaptation of the 
M-Blank will conduct their classes in such a 
way that there would be improvement in the 
general attitude of the pupil toward the teacher. 
It also appears from the data previously pre- 
sented that the pupil tests of personality, social 
adjustment, and social studies achievement are 
measuring to degree the same thing. However, 
these tests and the California Mental Maturity 
Test are not related. 


What has been stated concerning the tests of 
personality, social adjustment, social studies 
achievement, and intelligence, is true of the 
pre-tests as well as the post-tests of the three 
first mentioned. The relationship between the 
gains on these three tests and achievement is 
statistically significant but small. These 
factors except as previously noted appear not 
to be related to gains. 


111.3 The Prediction of Pupil Achievement 





Two measures of scholastic success will be 
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TABLE Vi 


COEFFICIENTS OF CORRELATION BETWEEN VARIOUS PUPIL FACTORS 
EXPRESSED AS Z-SCORES (WITH THE STANDARD ERROR AND 
SIGNIFICANCE LEVEL OF EACH CORRELATION 
COEFFICIENT) USING PRE-TEST SCORES 





Ey Significance Level 





Personality and 
Attitude of Teacher 12.64 


Personality and 
California Mental 
Maturity 


Personality and 
Right Conduct 


Personality and 
Social Adjustment 


Personality and 
Social Studies 
Achievement 


Attitude of Teacher 
and California 
Mental Maturity 


Attitude of Teacher 
and Right Conduct 


Attitude of Teacher 
and Social Adjustment 


Attitude of Teacher 
and Social Studies 
Achievement 


California Mental 
Maturity and Right 
Conduct 


California Mental 
Maturity and Social 
Adjustment 


California Mental 
Maturity and Social 
Studies Achievement 


Right Conduct and 
Social Adjustment .00 











Continued.......... 
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TABLE VII (Continued) 





Er Significance Level 





Right Conduct 
and Social Studies 
Achievement 


Social Adjustment 
and Social Studies 
Achievement 














TABLE VII 


COEFFICIENTS OF CORRELATION BETWEEN VARIOUS PUPIL FACTORS 
EXPRESSED AS Z-SCORES (WITH THE STANDARD ERROR AND 
SIGNIFICANCE LEVEL OF EACH CORRELATION 
COEFFICIENT) USING POST-TEST SCHORES 





E; Significance Level 





Personality and 
Attitude of Teacher ' -06 63.12 


Personality and 
California Mental 
Maturity 


Personality and 
Right Conduct 


Personality and 
Social Adjustment 


Personality and 
Social Studies 
Achievement 


Attitude of Teacher 
and California 
Mental Maturity 


Attitude of Teacher 
and Right Conduct 
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considered in this section, namely, (1) social 
achievement as defined by the gain in the co- 
operative social studies achievement, and 

(2) success defined by a composite of gain 
scores for personality, right conduct, social 
adjustment, and achievement. The prediction 
of social studies achievement will be consid- 
ered first. The intercorrelations for gain in 
social studies and pre-test scores in this 
area, conduct, social adjustment, and person- 
ality are given in Table XIV. 


It is found that the following pre-tests are 
positively and significantly related to gain in 
achievement, pre-test conduct, .21 + .06; pre- 
test social adjustment, .18 + .06; and pre-test 
personality, .20+ .06. The three coefficients 
of correlation are found to be significant at 
the one percent level. Other coefficients with 
their standard error and significance level are 
presented in Table VII. The coefficient of 
correlation between the pre-test on achieve- 
ment and the gain achievement is no greater 
than could normally be attributed to chance. 


Should one now use the three factors with 
significant coefficients, the correlation matrix 
as presented in Table xIV results. 


Using the Aitkin Method, a multiple correl- 
ation, Ro.123 = -230, is found with the resulting 
regression equation being xg = .1290x, - .0771x2 
- .0578x3. If only factors X; and X2 are consid- 
ered, the multiple correlation, Ro 320 is .227. 
The regression equation is x = .1587x, - 
.0991xg. Both coefficients of correlation are 
significant at less than the one percent level. 
However, personality does contribute when Xj 
and X2 are used as independent variables. In 
fact, the pre-test in right conduct is as good, 
for all practical purposes, for predicting 
achievement as is the combination of xj and 
XQ OF X4, X9, and x3. 


The question might be asked, since both 
rural and city pupils are considered in this 
study, whether it might not be more practical 
to consider the rural and city pupils separately 
in attempting to discover the relationship be- 
tween the pre-tests and 1.Q. and gain in achiev- 
ment. There were 90 pupils in the city schools 
and 168 in the rural. 


When the city pupils are considered alone, 
the coefficients of correlation of Table XII 
are found. When the total of 258 pupils were 
considered, significant correlations were 
found between gain in achievement and the 
following pre-tests taken separately: achieve- 
ment, conduct, and social adjustment. It is 
noted that when city pupils alone are considered 
there are no significant correlations at even the 
five percent level. 
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However, when one used only the 168 rural 
pupils, omitting the city pupils from the total 
group, correlations significant at the one per- 
cent level result between the gain in achieve- 
ment and the pre-test scores on the Califor- 
nia Personality, Right Conduct, and Social Ad- 
justment tests. The coefficients of correlation 
are .29+ .08, .32 + .08, and .27+ .08. These 
with other coefficients are presented in Table 
XVI. Again, it may be pointed out that appar- 
ently the pre-test in achievement, the attitude 
toward the teacher, and the intelligence of the 
pupils do not condition gain in achievement. 


It is difficult to explain the apparent differ- 
ence in ability to predict achievement gain for 
rural and city pupils. One may account for this, 
possibly, through considering the outside factors 
related to the aforementioned gain. It would 
seem that rural pupils have equal opportunities 
to a greater degree than city pupils because 
of the type of work which they do in out-of- 
school hours. Nearly all of them have ‘‘chores’’ 
to do on the farm while city pupils have varied 
experiences and unequal opportunities for study. 
Some work on paper routes and similar assign- 
ments, while others have a great deal of time 
for home study. 


Many people will be interested in pupil gains 
not merely in subject matter achievement but 
in conduct, social adjustment, and personality 
development. Accordingly, a composite of the 
gains in these several areas was developed 
for further consideration. 


To obtain an overall picture of pupil growth 
and achievement, a composite of gain scores 
for personality, right conduct, social adjust- 
ment, and achievement was made. The pro- 
cedure was to add the z-score gains for each 
pupil, hereafter called the composite gain. 


In an attempt to predict this composite gain 
and ultimately to develop a criterion score of 
teaching effectiveness based upon residual 
pupil gains, zero-order coefficients of cor- 
relation were obtained between the composite 
gain and the pre-test, post-test, and gain on 
the Remmers’ Scale for Measuring Attitude 
Toward any Teacher. The coefficient of cor- 
relation was also found between the composite 
gains and the California Test of Mental Matur- 
ity. The Pearson-P roduct-Moment Coeffic- 
ients of Correlation were as follows: compos- 
ite gain and pre-test attitude, .03; composite 
gain and post-test attitude, .32; composite 
gain and gain attitude, .02; and composite gain 
and California Mental Maturity, .26. 


Only the correlations between the composite 
gain and the post-test attitude and California 
Mental Maturity were significant, both being 
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TABLE IX 


COEFFICIENTS OF CORRELATION BETWEEN VARIOUS PUPIL FACTORS 
EXPRESSED AS Z-SCORES (WITH THE STANDARD ERROR AND 
SIGNIFICANCE LEVEL OF EACH CORRELATION CO- 
EFFICIENT) USING GAIN SCORES 





E, Significance Level 





Personality and 
Attitude of Teacher 06 24.20 


Personality and 
California Mental 
Maturity 


Personality and 
Right Conduct 


Personality and 
Social Adjustment 


Personality and 
Social Studies 
Achievement 


Attitude of Teacher 
and California 
Mental Maturity 


Attitude of Teacher 
and Right Conduct 


Attitude of Teacher 
and Social Adjustment 


Attitude of Teacher 
and Social Studies 
Achievement 


California Mental 
Maturity and Right 
Conduct 











Continued 
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TABLE IX (Continued) 





Er Significance Level 





California Mental 
Maturity and Social 
Adjustment 


California Mental 
Maturity and Social 
Studies Achievement 


Right Conduct and 
Social Adjustment 


Right Conduct and 
Social Studies 
Achievement 


Social Adjustment 
and Social Studies 
Achievement 


Gain Achievement 
and Pre-Test Attitude 


Gain Achievement and 
Post-Test Attitude 


Gain Achievement and 
Pre-Test Conduct 


Gain Achievement and 
Pre-Test Social Adjust- 
ment 


Gain Achievement and 
Pre-Test Personality 


Gain Social Adjustment 
and Attitude of Teachers 


Gain Social Adjustment 
and California Mental 
Maturity 


Gain Social Adjustment 
and Pre-Test Social 
Adjustment 


Gain Social Adjustment 
and Pre-Test Achievement 

















JOURNAL OF EXPERIMENTAL EDUCATION (Vol. XVIII) 


TABLE X 


COEFFICIENTS OF CORRELATION BETWEEN VARIOUS PUPIL FACTORS 
EXPRESSED AS Z-SCORES (WITH THE STANDARD ERROR AND 
SIGNIFICANCE LEVEL OF EACH CORRELATION 
COEFFICIENT) USING COMPOSITE GAIN 
SCORES 








Er Significance Level 





Pre-Test California 
Personality and Com- 
posite Gain 


Pre-Test Right Con- 
duct and Composite 
Gain 


Pre-Test Social Studies 
Achievement and Com- 
posite Gain 


Pre-Test Social Ad- 
justment and Com- 
posite Gain 

signif 
California Mental as the 
Maturity and Composite there 
Gain relatii 
up. 
Pre-Attitude and Com- 


posite Gain m.4 1 
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TABLE XII 


SUMMARY OF INTERCORRELATIONS OF POST-TESTS 
USING Z-SCORES FOR 258 PUPILS 





ia 


x3 





Personality x 
Attitude of 
Teacher x9 
California Men- 
tal Maturity x3 


Right Conduct X4 
Social Adjust- 
ment X5 
Social Studies 
Achievement 








-00 


16 


1.00 


-50 


1.00 





significant at the one percent level. Inasmuch 
as the attitude score is a post-test score and 
there was only one remaining significant cor- 
relation, the regression equation was not set 
up. 


1.4 The Measurement and Prediction of 
Teaching Success 








Of the 22 teachers cooperating in this study, 
twelve were located in rural, six in state graded, 
and four in city schools. Factors applied to 
these teachers were the M-Blank, a rating by 
the superintendents as outlined previously; 
the California Test of Mental Maturity; the 
Washburne Social Adjustment Inventory; and 
the Cooperative Social Studies Test. The means 
and standard deviations for these measures are 
presented in Table XVII . Since a few of the 
teachers did not take all of the tests, the assum- 
ption was made that the most probable value for 
each teacher would be the mean of all teachers 
measured. On the basis of this assumption, the 
mean of each measure was assigned to the 
teacher who had not taken the particular test. 


The superintendents’ mean rating, namely, 
3.8, of these teachers, was slightly above the 
theoretical average, 3.0, on the Wisconsin 
Adaptation of the M-Blank, Cooperative Study 
of the Secondary School Standards. The scale 
used was five, outstanding; four, above aver- 
age; three, average; two, below average; and 
one, poor. The teaching group stands on the 
average at the fortieth percentile in compar- 
ison with the college freshman I.Q., standard 
as measured by the California Mental Matur- 
ity Test. However, when compared with 





college graduates, they rank at only about the 
eighth percentile. 


The mean for the teachers on the Washburne 
Social Adjustment Inventory is at the fiftieth 
percentile of college students. Eight of the 
teachers fell in the low normal range while 
only two teachers fell in the superior range. 
The amount of pre-service education of the 
superior range varied from one to four years 
with a mean of 2.3. The range of teaching ex- 
perience and teacher age is 1 - 43, and 20 - 
68 years, respectively, with mean experience 
of 12.4 and mean age of 35.6. 


The Pearson-Product-Moment Coefficients 
of Rectilinear Correlation as computed for 
the teacher factors are presented in Table 
XVIII. Factors considered were the criterion 
of teaching efficiency as developed in Section 
III.1, the Wisconsin Adaptation of the M-Blank, 
the California Mental Maturity Test, the Co- 
operative Social Studies Test, the Social Ad- 
justment Inventory, age of the teacher, exper- 
fence of the teacher, and pre-service education 
of the teacher. 


Associating Tables XVIII and XIX, one notes no 
significant coefficients of correlation between 
superintendents’ ratings, M-Blank; and the 
California Mental Maturity; Social Studies 
Achievement, and Social Adjustment tests. 
Should the M-Blank ratings be valid measures 
of teaching efficiency, then it might be con- 
cluded that, considering all teachers, the 
measures used have no apparent value in deter- 
mining whether or not a teacher is a good 
teacher. 
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TABLE Xi 


SUMMARY OF INTERCORRELATIONS OF GAINS 


USING Z-SCORES FOR 258 PUPILS 











Personality 


Attitude of 
Teacher 


California Men- 
tal Maturity 


Right Conduct 


Social Adjust- 
ment 


Social Studies 
Achievement 











TABLE XIV 


INTERCORRELATIONS OF GAIN ACHIEVEMENT, PRE-TEST CONDUCT, PRE-TEST 
SOCIAL ADJUSTMENT, AND PRE-TEST PERSONALITY . 





Factors 


x2 





Gain Achievement 
Pre-Test Conduct 


Pre-Test Social 
Adjustment 


Pre-Test Person- 
ality 








-18 


53 








(Septer 


Since 
a more 
teache 
that pe 
by the 
the 18 
efficie 
Table | 
relatio 
ment a 
higher 
sidere 
be attr 
efficie 


What 
tionshi 
teache 
lations 
efficie 
teache 
found t 
criteri 
one ca 
upon r 
Menta! 
or Soc 
ation v 
teache 
or yea 
group 
study : 


The | 
pupil g 
amoun 
13+. 
when r 
of the 
staten 
visory 
to gre: 
chance 
of the 
these. 
of teac 


One | 
residu 
the ba 
worth 
super 
a supe 
well i 
ently 1 
Xxtill, 
ships | 
the M- 
specti 
ing the 
schoo! 


__ 


6. It 


(September 1949) 


Since the entire group of 22 teachers may be 
a more heterogeneous group than the 18 rural 
teachers when considered alone, it was felt 
that perhaps teaching efficiency as measured 
by the M-Blank could be predicted better for 
the 18 teachers than the total of 22. The co- 
efficients of correla tion are presented in 
Table XX . Though the coefficients of cor- 
relation between the M-Blank and the Achieve- 
ment and Social Adjustment tests appear to be 
higher when only the 18 teachers are con- 
sidered than with the 22, this difference could 
be attributed to chance since none of the co- 
efficients are significantly different than zero. 


What has been stated concerning the rela- 
tionship between the M-Blank and the various 
teacher factors can also be said of the re- 
lationship between the criterion of teaching 
efficiency, residual pupil gain, and the various 
teacher factors. Referring to Table XIX, it is 
found that none of the coefficients using this 
criterion are significant. This indicated that 
one cannot predict teaching efficiency based 
upon residual pupil gain through the California 
Mental Maturity, Social Studies Achievement, 
or Social Adjustment test. There is no associ- 
ation whatever between this criterion and 
teachers’ ages, years of teachers’ experience, 
or years of pre-service education when a 
group of teachers such as those used in this 
study are considered. 


The M- Blank and the criterion of residual 
pupil gain are not related by more than an 
amount that could be attributed to chance, 

13 + .22. The finding is substantiated by Lins® 
when referring to the Wisconsin Adaptation 

of the M-Blank and residual pupil gain in his 
statement that ‘‘ a composite of......super- 
visory ratings and pupil gain are not related 

to greater degree than can be attributed to 
chance. Whatever, then, is measured by each 
of the criteria, at least it appears evident that 
these criteria do not measure the same aspects 
of teaching efficiency.’’ 


One cannot predict either the criterion of 
residual pupil gain or the M-Blank rating on 
the basis of the teacher tests used. However, 
worth investigating is the possibility that 
supervisors will rate teachers well if they have 
a superior group of students who ‘‘show up’’ 
well in the presence of the supervisor. Appar- 
ently this is not the case, as shown by Table 
XXIII, for there are no significant relation- 
ships between the superintendent’s rating on 
the M-Blank and the student pre-tests. The re- 
spective pre-test factor was developed by find- 
ing the mean of all pupils in each particular 
schoo! and associating that mean with the M- 
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Blank rating of the teacher in that school. 


When, however, the relationship is found be- 
tween the mean pupil factors and the criterion 
of residual pupil gain, one notes that the pre- 
tests of social adjustment and attitude and the 
1.Q. as measured by the California Mental Ma- 
turity test are significantly associated with 
this criterion. The coefficients of correlation 
are presented in Table XXTI . 


Considering the correlation between pupil 
1.Q. and the ‘criterion of residual pupil gain, 
one might conclude that the teachers are spend- 
ing time with the pupils with low 1.Q.’s in an 
attempt to bring them up to the rest of the 
class in their work. Interpreted differently, 
since the criterion is based upon gains, it is 
possible that students who are doing poorly at 
the beginning of the year will make greater 
improvement than pupils who rank high at the 
beginning of the period. 


It would seem that if the pupil gains in atti- 
tude, personality, right conduct, achievement, 
and social adjustment could be attributed to 
the teacher, then there should be a significant 
correlation or relationship between each re- 
spective test gain and the criterion of residual 
pupil gain. To arrive at this determination, 
the mean gain for each school on each factor 
mentioned was found and this was correlated 
with the residual gain criterion. The resulting 
coefficients of correlation with their standard 
errors and significance level are presented in 
Table XXIII. 


The pupil factors do not appear to be signif- 
icantly related to the criterion at the one per - 
cent level. Gain in achievement is significantly 
related to teaching efficiency at the five percent 
level, which indicated that pupils of teachers 
with a high residual criterion will probably 
achieve better than those working with a teacher 
with a low criterion score. Positive but not 
statistically significant correlations were found 
between right conduct and attitude and residual 
pupil gain. 


As outlined previously, the superintendents 
rated the teachers upon the basis of the M-Blank. 
Their subjective estimate correlated with gain 
in student attitudes toward the teacher as evi- 
denced by a coefficient of correlation of .81 + 
.22. If then the superintendent rates a teacher 
high on the M-Blank, there is a high degree of 
possibility that the children will like that teach- 
er. 


111.5 Summary and Conclusions 
The central purpose of this section was to 








6, Ibid.,-p. 59. 
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TABLE XV 


CORRELATIONS OF GAIN IN ACHIEVEMENT FOR CITY SCHOOL 
PUPILS WITH PRE-TESTS SCORES AND I. Q. 





Factors E, 





California Personality d ll 
Right Conduct ll 
Pre-Test Achievement 
Pre-Test Social Adjustment ell 
Pre-Test Attitude 
California Mental Maturity d ll 














TABLE XVI 


CORRE LATIONS OF GAIN ACHIEVEMENT FOR RURAL SCHOOL 
PUPILS WITH PRE-TEST SCORES AND I. Q. 





Factors 





California Personality 
Right Conduct 

Pre-Test Achievement 
Pre-Test Social Adjustment 
Pre-Test Attitude 
California Mental Maturity 


*Significant at the one percent level. 














TABLE XVII 


MEANS AND STANDARD DEVIATIONS OF 
MEASURES APPLIED TO TEACHERS 





Measures M o 





M-Blank 87 
California Mental Maturity 13.89 
Social Studies Achieveme nt 18.08 
Social Adjustment 28.24 
Teacher’s Age 20 - 68 Years* 
Teaching Experience 1 - 43 Years* 
Years of Training 1- 4Years* 


*Range 
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TABLE XVIII 


COEFFICIENTS OF CORRELATION BETWEEN VARIOUS TEACHER FACTORS 
WITH THE STANDARD ERROR AND SIGNIFICANCE LEVEL OF 
EACH CORRELATION COEFFICIENT 





Significance Level 





Criterion and M-Blank d 54.84 


Criterion and California 
Mental Maturity 12.12 


Criterion and Social 
Studies Achievement 


Criterion and Social 
Adjustment 


Criterion and Teacher’s 
Age 


Criterion and Experience 
of Teacher 


Criterion and Years of 
Training 


M-Blank and California 
Mental Maturity 


M-Blank and Social 
3tudies Achievement 


M-Blank and Social 
Adjustment 


M-Blank and Teacher’s 
Age 


M-Blank and Experience 
of Teacher 


M-Blank and Years 
of Training 


California Mental Maturity 
and Social Studies Achieve- 
ment 


California Mental Maturity 
and Social Adjustment 


Social Studies Achievement 
and Social Adjustment 
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TABLE Xx! 


CORRELATIONS BETWEEN M-BLANK RATINGS AND PUPIL 
TESTS (MEAN OF EACH SCHOOL), TOTAL OF 22 SCHOOLS 
CONSIDE RED 





Factors 





Pre-Test Personality 
Pre-Test Right Conduct 
Pre-Test Achievement 
Pre-Test Social Adjustment 
Pre-Test Attitude 
California Mental Maturity 
I. Q. 














TABLE XxXiIl 


CORRELATIONS BETWEEN CRITERION OF RESIDUAL PUPIL 
GAIN AND PUPIL TESTS (MEAN OF EACH SCHOOL), TOTAL 
OF 22 SCHOOLS CONSIDERED 








Factors r 





Pre-Test Personality 21 
Pre-Test Right Conduct -.01 
Pre-Test Achievement -.23 
Pre-Test Social Adjustment -.44* 
Pre-Test Attitude 48 
California Mental Maturity 
I. Q. ~ .48* 














*Significant at five percent level. 


TABLE XxiIil 


CORRELATIONS OF GAIN ON PUPIL TESTS (MEAN OF EACH 
SCHOOL) WITH CRITERION OF RESIDUAL PUPIL GAIN 





Factors E;, 





Gain Personality 

Gain Right Conduct 
Gain Achievement 
Gain Social Adjustment 
Gain Attitude 














*Significant at five percent level. 
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study various factors which might be related in- 
dividually, or in combination, to pupil change 
measured over a three month period of time. 
The factors under consideration were 1.Q. as 
determined by the California Mental Maturity 
Test; pre-test scores on the Cooperative Social 
Studies Test, Washburne Social Adjustment In- 
ventory, Wood Right Conduct Test, Remmers 
Scale for Measuring Attitude Toward Any 
Teacher, the California Test of Personality, 
and teaching efficiency. 


On the basis of this investigation, certain 
generalizations and conclusions seem warran- 
ted: 


With respect to the pupils 





1. Certain seventh and eighth grade pupils 
in rural, state graded, and city schools are 
quite maladjusted. It was found that 10 per- 
cent of the state graded, two percent of the 
city, and 11 percent of the rural pupils were 
maladjusted at the time of the pre-test as 
here measured. Factors operating over the 
three-month period tended to increase this 
maladjustment so that at the time of the post- 
tests. 13 percent of the rural, 11 percent of 
the state graded, and six percent of the city 
pupils were maladjusted. 


2. Significant positive changes in conduct 
and achievement took place over the three- 
month period. However, factors operating 
over this period were either not related to or 
did not condition significant change in person- 
ality, adjustment, or attitude toward the 
teacher. 


3. Gain in achievement is positively and 
significantly related at the one percent level 
to the pre-tests of conduct .21+ .06; social 
adjustment, .18 + .06; and personality, .20 + 
-06. For all practical purposes, the pre-test 
of right conduct is as good a measure for 
predicting gain in achievement as is a com- 
posite of the right conduct, social adjustment, 
and personality pre-tests when weighted in 
that order: Ro 123 = -230, Ro.12 = .227. The 
factors of home environment and citizenship 
were eliminated from the study due to insuffic- 
ient data. 


4. Teaching efficiency measured in terms of 
residual pupil gain is significantly related (at 
the five percent level to pupil achievement). 


5. There appears to be no significant differ- 
ences in attitudes, adjustment, and personality 
between rural and urban pupils. 
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| 6. The tests of personality, social adjustment, 
and social studies achievement are measuring, 
to degree, the same thing since they are sig- 
nificantly interrelated. However, gains of the 
various factors are not associated by a greater 
amount than can be accounted for by chance, 
with the exception of personality and social 
adjustment (r= .24 + .06 - one percent signifi- 
cance level). 


7. When the z-score gains of the person- 
ality, right conduct, social adjustment, and 
cooperative social studies tests are compos- 
ited by finding the mean gain, the following 
are related at the one percent significance 
level: California Mental Maturity, .26 + .06; 
post-test attitude, .32 + .06; and pre-test 
right conduct, .20 + .16. These coefficients 
are low and have little individual predictive 
value. 


8. On the basis of the tests used, it is im- 
possible to predict gain in achievement with 
any acceptable degree of accuracy. 


With respect to the teacher 





1. There is a wide range in age, teaching 
experience, and pre-service education of 
teachers, 20 - 68, and 1 - 4 years, respec- 
tively. 


2. The tests. Washburne Social Adjustment 
Inventory, Wood Right Conduct Tests, and 
California Test of Mental Maturity, adminis- 
tered to the teachers have no apparent value 
in predicting whether a person will be a good 
or poor teacher in rural or city seventh or 
eighth grades. These measures are not sig- 
nificantly related to either the superintendents’ 
general evaluation on the M-Blank or the cri- 
terion of residual pupil gain. The same is 
true of the factors of age, teaching experience, 
and pre-service education. 


3. Whatever is being measured by the M- 
Blank and the criterion of residual pupil gain, 
they are not measuring the same thing or, 
under the assumption that they are measures 
of teaching efficiency, they measure different 
aspects of efficiency (r = .13 + .21). 


4. Teachers concentrate upon the maladjusted, 
low intelligence pupil who enjoys working with 
the teacher. 


5. On the basis of the measures here used it 
was impossible to predict teaching efficiency 
with an acceptable degree of accuracy. 
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A STUDY OF THE 
TEACHER 


DISTRIBUTION OF 


APPROVAL AND DISAP 


PROVAL AMONG SIXTH-GRADE 
PUPILS 


ALBERT F. deGROAT and GEORGE G. THOMPSON* 
Syracuse University 


The purpose of this investigation was to de- 
velop a scale to measure the social-incentive 
‘“‘climates’’ which children experience in their 
day-to-day interactions with teachers. As a 
second phase of the study an attempt was made 
to determine some of the psychological charac- 
teristics of children living under more or less 
clear-cut patterns of teacher approval and dis- 
approval. It was hoped that this research under- 
taking might be profitable in the following ways: 


1. To develop one type of evaluation instru- 
ment for estimating the relative amounts 
of teacher approval and disapproval ex- 
perienced by different pupils. 


. To isolate, in a preliminary way, some of 
the psychological characteristics of child- 
ren who are believed by their classmates 
to be living under different patterns of 
teacher approval-disapproval. 


No attempt has been made in this study to 
infer any cause-effect relationship that may ex- 
ist between patterns of teacher approval-disa- 
pproval and the psychological growth of child- 
ren. The results of the present inquiry leave 
unanswered the question as to whether children 
have developed certain patterns of behavior 
partially as a result of teacher approval-dis- 
approval or whether teachers show different 
evaluative attitudes toward children having 
particular psychological characteristics. This 
interesting problem has been left for subse- 
quent investigation in an experiment involving 
a longitudinal approach of one or more years. 


The present research is based on the assump- 
tion that the nominating procedure, i.e., having 
children nominate their classmates to fill 
certain teacher approval-disapproval descrip- 
tions, provides data that would be positively 
related to patterns of teacher approval-dis: 
approval as determined by an impartial adult 
observer. The study of this latter relation- 
ship has also been left for subsequent investi- 
gation. The writers believe that meaningful 
and worthwhile information is provided by the 
nominating technique, e.g., consider the case 





of the child who is mentioned over 100 times 

as experiencing teacher disapproval and is not 
nominated at all for teacher approval. Certainly 
it would appear that a child experiencing this 
pattern of teacher approval-disapproval would 
respond differently to the classroom situation 
than a child who might be experiencing exactly 
the opposite type of social-incentive conditions. 
The usefulness of these types of data will, of 
course, be more adequately realized through a 
series of investigations, of which the present 
research is merely an exploratory survey. 


Background Literature 





To the writers’ knowledge the nominating pro- 
cedure has not been previously applied to the 
measurement of social-incentive conditions ex- 
perienced by school children. This general 
type of psychological scaling technique has been 
most widely popularized by Moreno (5) and his 
students in the study of inter-personal relation- 
ships. More specifically the particular form of 
nominating procedure used in the present re- 
search was employed in the early Character 
Inquiry experiments conducted by Hartshorne 
and May (2). It was from Hartshorne and May’s 
study that the presently employed evaluation in- 
strument acquired its popularly known title, the 
‘‘Guess Who?’’ technique. This scaling proced- 
ure consists simply of presenting a brief and 
relatively unstructured description of some be- 
havior that could be attributed to any member 
of a group and then asking the various members 
of this group to nominate one or more of their 
associates whose behavior most nearly fits the 
description. This ‘‘Guess Who?’’ approach is 
familiar to most children since it is popularly 
used in school newspapers, annuals, and joke 
books. It has been used in recent years by 
Tryon (9) and Kuhlen and Lee (3) to study the 
personality characteristics of popular and un- 
popular children during the adolescent period. 
The following description is an example of one 
of the twelve approval items employed in the 
present research: 


‘Here is someone who is praised for always 
having his or her assignments done on 





*The writers wish to ecknowledre a research crant-in-sid for this study from 
the School of Education, Syracuse University. 
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time.”’ 


This approach to the measurement of the 
teacher approval-disapproval atmosphere ex- 
perienced by pupils has the advantages of being 
simple to administer and score, of being inter- 
esting to the pupils, and of providing a fairly 
direct evaluation of social-incentive conditions 
in the classroom as children see them. This 
latter advantage is important in that it expressed 
one aspect of the child’s reaction to his compan- 
ions as well as reflecting to some degree teacher} 
approval-disapproval conditions as an impartial 
adult observer might see them. 


The literature devoted to the effects of teach- 
er approval and disapproval on the child’s be- 
havior is extensive but somewhat contradictory. 
The research on the effects of praise and blame 
as incentives to learning has been carefully e- 
valuated by Schmidt who concluded that, ‘‘The 
preceding review of the literature leaves no 
definite conclusions to be drawn as to the effect- 
iveness of praise and blame as incentives to 
learning.’’ (6, p. 7) A more promising approach 
to the effects of approval and disapproval on 
learning efficiency was recently reported by 
Grace (1) in which she attempted to relate per- 
sonality characteristics to children’s responses 
to approval and disapproval in a learning situa- 
tion. Although no clear-cut relationships were 
secured in her study, some promising trends 
were revealed. The results of Grace’s study 
are similar to those reported by Thompson 
and Hunnicutt (8) in which they found a relation- 
ship between response to praise and blame and 
the personality characteristics of introversion 
and extroversion. The relationship between per- 
sonality characteristics and responses to approv- 
al and disapproval needs to be explored more 
adequately in future investigations. As a work- 
ing hypothesis the writers believe that children’s 
psychological growth is observably affected by 
the social-incentive conditions which they typ- 
ically experience. Since children spend a fair- 
ly large portion of their waking hours in a 
classroom situation where a large share of 
status rewards are governed by the teacher, it 
seems important to investigate the patterns of 
teacher approval-disapproval found in different 
classrooms. It is also the writers’ belief that 
an understanding and observant teacher will 
generate social-incentive atmospheres in 
which approval and disapproval are more or less 
homogeneously distributed among the various 
members of her group. The teacher less sens- 
itive to the mental hygiene needs of children 
will probably be evaluated by her pupils as 
having ‘‘pets’’ who receive a disapproportion- 
ately high amount of commendation and as hav- 
ing less favored pupils who receive the bulk of 
her disapproval. Hence it may be that this scale 
of teacher approval-disapproval will be useful 





in evaluating the success of prospective teachers 
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as well as in estimating the probable behavior of 
children living under various patterns of social 
incentives. 


Development of the Teacher Approval and 
Disapproval Scales 








The most direct and simple method of develop- 
ing teacher approval and disapproval scales 
would be to present only two descriptions to the 
pupils: ‘‘Here is someone who is always being 
praised by the teacher,’’ and ‘‘Here is someone 
who is always being blamed by the teacher.’’ 
The disadvantage of such a simple approach is 
that children would be likely to nominate class- 
mates for approval or disapproval situations 
on the basis of a general ‘‘halo’’ effect. Another 
possible approach, and the one adopted for this 
study, is to sample a fairly large number of 
commonly occurring situations in which child- 
ren are commended or rebuked by the teacher 
for different types of behavior. It was felt that 
this sampling approach would offset, to some de- 
gree, the influence of the ‘‘halo’’ effect and would 
also help to refresh the child’s memory as to 
some of the situations in which children are 
praised or blamed by their teachers. The be- 
havior descriptions which were selected are, in 
the writers’ opinion, fairly representative of the 
types of situations in which children typically re- 
ceive approval or disapproval from their teach- 
ers. They were selected on the basis of con- 
trolled observation of teacher-child interactions 
in several sixth-grade classrooms. 


Preliminary Scale 





The first teacher approval and disapproval 
scales that were developed consisted of the 
following ten approval and ten disapproval items: 


Teacher Approval 





Here is someone whose work is often pointed 
out as being very neat. 


Here is someone who is often asked by the 
teacher to be a monitor while she is out of 
the room. 


Here is someone who is often called on to 
answer questions about the lesson when vis- 
itors are present. 


Here is someone who is often chosen by the 
teacher to take leading parts in plays and 
assembly programs. 


Here is someone whose work is often put up 
on the bulletin board. 


Here is someone who is often praised for 
good writing on all papers handed in. 
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Here is someone whom the teacher often 


members of your class. Read each state- 
asks to do errands for her. 


ment and write down the names of the person 
whom you think the description fits. Remem- 


Here is someone whom the teacher often lets | ber--- 


stay after school to help her. 

1. Several people may fit one picture. You 
may write down after each description as 
many names as you think belong there. 
(The number of choices was actually lim- 
ited by the amount of space available for 
writing--six lines were provided.) 


Here is someone whom the teacher often 
praises for having good manners. 


Here is someone who is often praised for 
good recitations. 


. The same person may be mentioned for 


Teacher Disapproval more than one word-picture. 





Here is someone whom the teacher often 


. If you cannot think of anyone to match a 
bawls out for whispering. 


particular word-picture, go on to the next 


one. 
Here is someone who is often suspected by 


the teacher when something happens while 


. Write the given and last names of the per- 
she is out of the room. 


sons chosen, not the nicknames or initials. 


Here is someone who often has to stay after 
school to finish the day’s work or because of 
misbehavior. 


. Put you own name down if you think the 
description fits you. 


The results obtained in two typical classrooms 
are presented graphically in Figure 1. An in- 
spection of the data shows that a few children re- 
ceive a high percentage of the nominations for 
teacher disapproval; a similarly small number 
of children receive a high percentage of the nom- 
inations for teacher approval, while the majority 
of the children receive few, if any, nominations 


Here is someone whom the teacher often 
bawls out for picking on someone else. 


Here is someone to whom the teacher often 
speaks because he or she seldom has the 

paper or pencil with which to work, or, be- 
cause he or she is slow in getting them out 


when it is time to begin to work. 


Here is someone whom the teacher often 
bawls out for shooting paper wads, sailing 
paper airplanes, tripping people, punching 
someone, or for disturbing the class in 
some other way. 


Here is someone whom the teacher often 
scolds for talking without raising his or her 


for either teacher approval or teacher disap- 
proval. As discussed later in this report, the 
relationship between nominations for teacher 
approval and teacher disapproval appears to 
vary somewhat from classroom to classroom. 


In an effort to determine the internal consis- 
tency of the preliminary instruments rank cor- 
relation coefficients between the ‘‘odd-even’’ 
items were computed for both the teacher ap- 


hand, or for interrupting others while they 


proval and disapproval scales. These rank cor- 
are talking. 


relation coefficients, presented in Table I, dem- 
onstrate a fairly high degree of internal consis- 


Here is someone who seldom has his or her 
work put up on the bulletin board. 


Here is someone who is often bawled out by 
the teacher because he or she pays little 


attention to what is going on during the class 
period. 


Here is someone who is often scolded by the 
teacher for handing in untidy papers or be- 
cause his or her writing is so poor. 


tency among the various items within each of 
the scales. There appears to be somewhat 
greater communality among the items making 
up the disapproval inventory. An analysis of 
individual items showed that some of the ap- 
proval and disapproval situations elicited a 
very small number of nominations. There was 
an apparent need to delete or revise some of 
the items, and to add others. 


Development of Revised Scales 





The revised scales in final form consisted of 
the following twelve teacher approval items and 
twelve teacher disapproval items: 


These twenty items were ‘‘scrambled’’ and 
presented with the following instructions to 
four sixth-grade classrooms in a large urban 
center: 


In this booklet are some word pictures of 





Teacher Approval 





Here is someone whose work is often put up 
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Figure 1 


Showing the distribution of teacher approval-disapproval in two sixth-grade class- 
rooms located in a large urban center. Note that a small number of children receive 
a high percentage of the nominations for teacher approval; also that a small number 
of children receive a high percentage of the nominations for teacher disapproval. It 
may also be noted that a fairly large number of children receive few, if any, nomina- 
tions for either teacher approval or teacher disapproval. 
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TABLE I 


RANK CORRELATION COEFFICIENTS BETWEEN OD- 
EVEN ITEMS ON THE PRELIMINARY TEACHER 
APPROVAL AND DISAPPROVAL SCALES* 





Number of Prel Scales 





Pupils Approval Disapproval 





31 67 -90 
36 71 87 
30 -66 82 


20 -83 -63 














*Data obtained from four sixth-grade classrooms in a 
large urban center. 


TABLE fl 


RANK CORRELATION COEFFICIENTS BETWEEN SCORES 

SCORES OBTAINED BY TEACHER APPROVAL AND DIS- 

SCALES ON TWO OCCASIONS SEPARATED BY A FIVE- 
WEEK INTERVAL* 





Number of Preliminary Scales 





Pupils Approval Disapproval 





I 38 -18 88 











bat 29 81 -86 





*Data obtained from two sixth-grade classrooms in a 
suburban school system. 


TABLE I 
RANK CORRELATION COEFFICIENTS BETWEEN ODD- 


EVEN ITEMS ON THE REVISED TEACHER APPROVAL 
AND DISAPPROVAL SCALES* 








Number of Revised Scales 





Pupils Approval Disapproval 





1 37 
2 33 
3 32 
4 31 














*Data obtained from four sixth-grade classrooms in 
a small city. 
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on the bulletin board. 


Here is someone whose work is often pointed 
out as being very neat. 


Here is someone who is often praised for 
good writing on all papers handed in. 


Here is someone whom the teacher often 
chooses to help other children with their 
work. 


Here is someone who is praised for always 
having their assignments done on time. 


Here is someone who has been praised by 
the teacher because his or her behavior has 
improved. 


Here is someone whom the teacher praises 
for trying hard, even if the work isn’t easy 
for him or her. 


Here is someone who is praised for doing 
extra work, or for bringing things to school 
for the class to see. 


Here is someone on whom the teacher calls 
when she wants the right answer. 


Here is someone who is praised for helping 
keep the room clean and orderly. 


Here is someone who is chosen by the 
teacher to take leading parts in plays and 
assembly programs. 


Here is someone whom the teacher often 
asks to do errands for her or to be a mon- 
itor while she is out of the room. 


Teacher Disapproval 





Here is someone whom the teacher often 
scolds for whispering. 


Here is someone whom the teacher often 
scolds for disturbing the class in some way 


(shooting paper wads, chewing gum, etc.). 


Here is someone who is often scolded by the 
teacher because he or she pays little atten- 
tion to what is going on in class. 


Here is someone who has to stay in the room 
during recess or after school to finish their 
work. 


Here is someone who is often scolded by the 
teacher for handing in untidy papers. 


Here is someone who is often suspected by 
the teacher when something happens while 
she is out of the room. 
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Here is someone whom the teacher often 
scolds for talking without raising his or 
her hand. 


Here is someone whom the teacher often 
points out as wasting too much time. 


Here is someone who is often pointed out as 
not doing their best work. 


Here is someone whom the teacher often 
scolds for starting fights or picking on some- 
one else. 


Here is someone to whom the teacher often 
speaks because they do not have their things 
ready when it is time to begin to work. 


Here is someone who sometimes has to go 
to the principal’s office or has to be punished 
by the teacher. 


Six of the original teacher approval items were 
retained in their exact or slightly revised form 
and six new items were added. Similarly in the 
teacher disapproval scale, nine of the original 
items were retained in their exact or slightly 
modified form and three new items were added. 
The instructions for administering the revised 
scale were similar to those adopted for the 
preliminary scale. In the revised scale the 
number of nominations for each item was lim- 
ited by the amount of space available for writing 
--only four lines were provided, whereas six 
lines were available in the preliminary form. 
It was believed that this revision might ‘‘sharpen’’ 
the discriminatory power of the scales. Since 
some overly-conscientious children demon- 
strate an apparent need tp use all of the avail- 
able space, they may continue to add names 
which do not necessarily fit the word-descrip- 
tion. The only other change in the directions 
was to instruct the child that he could not nom- 
inate himself for any of the situations. These 
self-nominations, which were permitted in the 
preliminary form, undoubtedly provide addition- 
al and useful information about children, but it 
appears obvious that they would not be distrib- 
uted along the same psychological continuum 
as nominations of fellow classmates. 


The revised teacher approval and disapproval 
scales were administered to two classrooms 
in a suburban school system on two different 
occasions, separated by an interval of five 
weeks. The data obtained from the first admin- 
istration of the teacher approval and disapproval 
scales are presented graphically in Figure 2. 
These data appear to be quite similar to those 
obtained with the preliminary scales (Fig. 1), 
in that they show a very small number of 
children receiving a high percentage of the 
teacher approval nominations, a few pupils re- 
ceiving most of the teacher disapproval nom- 
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Figure 2 


Showing the distribution of teacher approval-disapproval in two sixth-grade class- 
rooms located in a suburban school system. Note that a small number of children 
receive a high percentage of the nominations for teacher approval; also that a small 
number of children receive a high percentage of the nominations for teacher disap- 
proval. It may be noted that a fairly large number of children receive few, if any, 
nominations for either teacher approval or teacher disapproval. The distribution of 
teacher approval-disapproval in Classroom II, which is a classroom of children of 
below-average intelligence, is somewhat restricted in total range. 
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inations, and the majority receiving few, if any, 
nominations for either teacher approval or 
disapproval. It is interesting to note th>+ Class- 
room II, which enrolled children of lower men- 
tal ability, reflects the same trend in the dis- 
tribution of teacher approval-disapproval, but 
is significantly restricted in range of scores. 
The conditions that effect this restriction in 
range of scores may reflect the attitudes of the 
teacher as well as the mental status of the 
pupils. 


The stability of pupil rankings over the five- 
week interval with respect to teacher approval 
and disapproval is shown in Table II. The con- 


sistency of the teacher disapproval scale appears 


to be greater than that of the teacher approval 
scale. These rank correlation coefficients may 
be interpreted as indicating a fairly high amount 
of pupil consistency in nominating their class- 
mates to fill teacher approval and teacher dis- 
approval descriptions. These rank correlation 
coefficients are fairly satisfactory for tests 
separated by a five-week interval, since it is 
probable that some real changes in rank, which 
would be revealed by a ‘‘perfect’’ measuring 
instrument, occurred during this five-week in- 
terval. 


The revised teacher approval and disapproval 
scales were administered to four sixth-grade 
classrooms in a small city to provide data for 
an examination of the individual items. Rank 
correlation coefficients of the ‘‘odd-even’’ 
items were computed for the revised teacher 
approval and teacher disapproval scales. These 
correlation coefficients, presented in Table ITI 
are very similar to those obtained from the pre- 
liminary scales. In the revised as well as in 
the preliminary instruments there appears to 
be somewhat greater communality among the 
items comprising the teacher disapproval 
scale. One might conjecture that it is easier 
for children to identify individuals receiving 
teacher disapproval. An alternative hypothesis 
which one should also consider is that it is 
easier to construct items which sample the 
total population of teacher disapproval situa- 
tions. The revised scales appear fairly satis- 
factory as to internal consistency of items and 
test-retest stability. 


Psychological Characteristics of Children 
Experiencing Various Patterns of Teacher 
Approval and Teacher Disapproval 











After developing scales of teacher approval 
and teacher disapproval that appeared to be 
fairly satisfactory with respect to internal con- 
sistency and test-retest stability, the writers 
decided to study some of the psychological 
characteristics of children who might exper- 
ience the following patterns of teacher approval- 


(Vol. XVI) 


disapproval: 


1. Children receiving large amounts of ap- 
proval and little disapproval. 


- Children receiving large amounts of ap- 
proval and a moderate amount of disap- 
proval. 


. Children receiving little or no approval 
or disapproval. 


. Children receiving large amounts of dis- 
approval and a moderate amount of ap- 
proval. 


- Children receiving large amounts of dis- 
approval and little approval. 


As a first approximation one can consider these 
patterns of teacher approval-disapproval as ad- 
joining areas in a two-dimensional plane of 
teacher approval-disapproval. Thus in a sense, 
these five patterns constitute a two-dimensional 
*‘area-continuum.’’ An individual child’s scores 
on both teacher approval and disapproval would 
determine his inclusion in one of the five surface 
patterns. Since the above procedure is merely 
an approximation adopted for experimental anal- 
ysis a number of children may not be included 
in any of the five surface patterns. It was be- 
lieved by the writers that the pattern approach 
adopted for this analysis would reveal relation- 
ships between psychological characteristics of 
children and social-incentive conditions in the 
classroom that a correlational approach with 
the isolated variables, teacher approval and 
teacher disapproval, might not reveal. 


This phase of the investigation was explora- 
tory in a very real sense. It was designed as 
an experiment to test some of the more ob- 
vious hypotheses which come to mind: (1)Bright 
children receive more teacher approval than 
disapproval, whereas dull children experience 
more disapproval than approval from their 
teachers; (2) Children who are achieving above 
the class average receive more approval than 
disapproval, and vice versa for children below 
the class average; (3) children who receive 
little teacher attention, either approval or dis- 
approval, are likely to be withdrawn and re- 
tiring. It is possible to develop other hypotheses 
similar to the ones mentioned above. In this 
phase of the investigation it was decided to study 
the intelligence level, the achievement level, and 


the personality adjustment of children experienc- 
ing fairly clear-cut patterns of teacher approval- 


disapproval. Four classrooms located ina 
small city in upstate New York were selected 
for this phase of the investigation. The follow- 
ing tests were administered to the 133 pupils in 
these four classrooms: (1) Teacher Approval- 
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Disapproval Scales; (2) California Test of Men- 
tal Maturity, Short Form; (3) Progressive 
Achievement Tests; and (4) California Test of 
Personality. The means and standard deviations 
obtained with the standardized tests in each of 
the four classrooms are presented in Table IV. 


The data obtained with the teacher approval 
and disapproval scales are presented graphically 
in Figure 3. The data appear to be quite sim- 
ilar to those obtained in the preliminary phases 
of this study (Figs. 1 and 2) in that they show a 
few children receiving most of the nominations 
for teacher approval, a very small number re- 
ceiving most of the nominations for teacher dis- 
approval, and the bulk of the children receiving 
few, if any, nominations for either teacher ap- 
proval or disapproval. 


For the present study an attempt was made to 
isolate those children experiencing each of the 
five social-incentive patterns listed above. 
There appeared to be no rational or entirely 
clear-cut empirical method of isolating these 
children experiencing different patterns of 
teacher approval-disapproval. The following 
rule-of-thumb procedures were finally adopted 
to isolate within a given classroom those child- 
ren experiencing the five patterns of teacher 
approval -disapproval: 


1. Those pupils in the upper quartile of 
teacher approval nominations and falling 
at or below the median of disapproval 
nominations were designated as the high 
approval-low disapproval group (HA-LD). 


. Those pupils in the upper quartile of 
teacher approval nominations and falling 
above the median of disapproval nomina- 
tions were designated as the high approval- 
medium disapproval group (HA-MD). 


. Those pupils falling at or below the med- 
ian for both teacher approval and disap- 
proval nominations were designated as the 
low approval-low disapproval group (LA- 
LD). 


- Those pupils in the upper quartile of 
teacher disapproval nominations and fall- 
ing above the median of approval nomin- 
ations were designated as the high disap- 
proval-medium approval group (HD-MA). 


. Those pupils in the upper quartile of 
teacher disapproval nominations and fall- 
ing at or below the median of approval 
nominations were designated as the high 
disapproval-low approval group (HD-LA). 


The children included in the HA-LD pattern in 
each of the four classrooms were combined into 
aHA-LD group. The same procedure for com- 


bining subjects from the four classrooms was 
adopted for the other four patterns of teacher 
approval-disapproval. The distributions of 
teacher approval-disapproval for the five social- 
incentive patterns are presented graphically in 
Figure 4. It may be seen that there is a high de- 
gree of similarity between some of the points in 
the HA-LD and the HA-MD groups as well as be- 
tween some of the points in the HD-MA and the 
HD-LA groups. If a much larger population of 
children had been available the rule-of-thumb 
procedures listed above could have been adjusted 
so that sharper differentiation between these ad- 
joining groups might have been effected. 


A point of interest which certainly needs further 
investigation is the differential distribution of 
teacher approval and teacher disapproval nomin- 
ations in the different classrooms. 


In Classroom 1 those four pupils included 
in the HA-LD classification received 36 per- 
cent of the 916 nominations for the twelve 
teacher approval items. In this same class- 
room the four pupils included in the HD-LA 
group received 24 percent of the 780 nomin- 
ations for the twelve teacher disapproval 
items. 


In Classroom 2 the four pupils in the HA- 
LD group received 39 percent of the 635 
teacher approval nominations. The three 
pupils in the HD-LA group received 39 per- 
cent of the 622 teacher disapproval nomin- 
ations. 


In Classroom 3 the six pupils in the HA- 
LD group received 64 percent of the 805 
teacher approval nominations. The four 
pupils in the HD-LA group received 37 per- 
cent of the 936 teacher disapproval nomina- 
tions. 


In Classroom 4 the seven pupils in the HA- 
LD group received 72 percent of 886 teacher 
approval nominations. The five pupils in the 
HD-LA group received 39 percent of the 
1122 teacher disapproval nominations. 


It can be seen from these percentages that a 
very small number of children receive from 
36 to 72 percent of the total nominations for 
approval items. -It seems reasonable to believe 
that this variation in percentages from class- 
room to classroom may be a meaningful vari- 
able for defining the teacher approval-disap- 
proval structure of particular classrooms. ‘It 
can likewise be seen that a very small number 
of pupils receive from 24 to 39 percent of the 
total nominations for disapproval items. This 
variation in percentages may also provide 
useful data along the lines indicated above. 





It will be noted that only 86 of the total avail- 
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Figure 3 


Showimg the distribution of teacher approval-disapproval in four sixth-grade class- 
rooms located in a small city. Note that a small number of children receive a high 
percentage of the nominations for teacher approval; also that a small number of chil- 
dren receive a high percentage of the nominations for teacher disapproval. It may 
also be noted that a fairly large number of children receive few, if any, nominations 
for either teacher approval or teacher disapproval. 
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Figure 4 


Showing the distribution of teacher approval and disapproval in the five groups 
selected for experimental comparisons. HA-LD indicates high approval-low 
disapproval; HA-MD indicates high approval-moderate disapproval; LA-LD in- 
dicates low approval-low disapproval; HD-MA indicates high disapproval-mod- 
erate approval; HD-LA indicates high disapproval-low approval. 
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able population of 133 pupils were employed 
for investigating the psychological character- 
istics of children experiencing different pat- 
terns of teacher approval-disapproval. Some 
of the cases were omitted because of incom- 
plete test information due to absences, while 
other cases were discarded because they did 
not satisfy the selection criteria for any of the 
five patterns of teacher approval-disapproval. 


The means and the standard deviations of the 
California Mental Maturity, the Progressive 
Achievement and the California Personality 
Test scores for each of the five approval-dis- 
approval groups are presented in Table V. It 
may be seen by inspecting this table that 
there are some fairly large differences among 
the standard deviations obtained for the five 
social-incentive groups on a particular test. 
One may note in particular the relatively large 
standard deviation of the HA-LD group on 
Achievement with respect to achievement 
scores of the other groups. Because of this 
apparent lack of homogeneity of variance 
among the five approval-disapproval groups 
on certain of the test scores, and because the 
N’s were relatively small it was decided to 
apply the F-test for homogeneity of variance 
before testing the significance of differences 
between means.! The results of the F-tests 
are presented in Table VI. 


These results may be interpreted as follows: 


1. The five approval-disapproval groups 
with respect to mental maturity appear 
to be homogeneous as to variability, i.e. 
the null hypothesis cannot be rejected 
for any of the ten comparisons at either 
the one percent or five percent levels 
of confidence. 


. The variability of achievement scores 
appears to be significantly higher for the 
HD-LA group than for the HA-LD, HA- 
MD, and LA-LD groups, i.e. the null 
hypothesis can be rejected at either the 
one percent or five percent levels. 


3. The variability of the total scores obtained 
on the California Test of Personality ap- 
pears to be somewhat higher for the LA- 
LD group and somewhat lower for the HA- 
LD group, at least in some of the compar- 
isons. The difference in variability be- 





tween HA-LD and HD-LA is significant at 
the five percent level. The variability of 
the LA-LD group is significantly different 
at the five percent level from both the HA- 
LD and the HA-MD groups. 


The variability of the ‘‘self’’ scores of the 
personality test appears to be relatively 
homogeneous for the five groups, i.e. the 
null hypothesis cannot be rejected at 
either the one percent or five percent 
levels for any of the comparisons, 


The variability of the ‘‘social’’ scores on 
the personality test appears to be some- 
what higher for the HD-LA group and 
somewhat lower for both the HA-LD and 
HA-MD groups. The variability of the 
HA-LD group is significantly lower than 
that of the HD-LA, HD-MA, or LA-LD 
groups. The variability of the HA-MD 
group is significantly lower than that of 
the HD-LA, HD-MA, and LA-LD groups. 


The psychological significance of some of 
these differences in variability is considered 
in the section on differences between means 
which follows directly below. The significant 
differences in variability existing among some 
of the group comparisons place certain limita- 
tions on the interpretation of differences be- 
tween mean scores, which are presented in 
Table VII. It will be noted that in this table the 
results of applying the t-test are presented in 
parentheses for those comparisons of groups 
in which there is a significant difference in vari- 
ability at either the one or five percent levels of 
confidence. When the scores of two groups are 
not homogeneous with respect to variability, the 
interpretation of the results of a t-test cannot 
be made in a rigorous fashion.2 The t’s pre- 
sented in Table VII may be interpreted as 
follows: 


1. With respect to mental maturity the HA- 
LD group is significantly more intelli- 
gent than the LA-LD and the HD-LA 
groups.3 The HA-MD group is signifi- 
cantly more intelligent than the LA-LD, 
the HD-MA, and the HD-LA groups. This 
finding may be interpreted as supporting 
the hypothesis suggested previously in 
this report, that the more intelligent 
children in a typical classroom receive 
a significantly larger amount of teacher 











. Snedecor, G. ¥, Statistical Methods (Ames, Iowa: Collegiate Press, 1939), 


D. 249, 


. Lindouist, E. F. Statistical Analysis in Educational Research (Chicago: 
Houghton Mifflin, 1949), p. 56. 


. As indiceted in the legend to Table VII, only those t's which permit one 
to reject the null hypothesis at either the one or five percent levels 
of confidence are considered "significant." 
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approval, as seen by their classmates, 
while the less intelligent pupils exper- 
ience larger amounts of teacher dis- 
approval. 


2. In academic achievement the mean score 
of the HA-LD group is significantly high- 
er than the HD-MA group. The HA-MD 
group obtained a mean score significantly 
higher than the LA-LD and HD-MA groups. 
This may be interpreted as indicating that 
those children who show evidence of hav- 
ing acquired more subject matter knowl- 
edge receive a higher proportion of 
teacher approval, at least as judged by 
their classmates. 


3. The means of the total scores on the 


California Test of Personality follow a 
similar trend. The HA-LD group is sig- 
nificantly ‘‘better adjusted’’ than the 
HD-MA group. Th HA-MD group shows 
‘‘superior adjustment’’ to the HD-MA 

and the HD-LA groups. At this point it 
may be well to recall that the total score 
on the California Test of Personality rep- 
resents the child’s ‘‘Adjustment’’ as he 
sees it, an ‘‘introspectionistic’’ evalua- 
tion. In view of the nature of this test it 
appears to the writers that the results 
obtained in this study with respect to 
teacher approval and teacher disapproval 
are very provocative. One might expect 
the traditional teacher to award more 
approval to the brighter children who do 
better work. However, it seems extreme- 
ly doubtful that the average teacher has a 
great deal of knowledge of, or is in any 
significant way influenced by, the child’s 
evaluation of his own ‘‘adjustment.’’ On 
the basis of these findings, it appears that 
the child’s evaluation of his own adjust- 
ment may be influenced by his level of 


achievement and/or the teacher’s approval- 


disapproval attitudes toward him. The 
direction and nature of any cause-effect 
relationship that may exist between per- 
sonality adjustment and teAcher approval- 
disapproval remains an interesting prob- 
lem for future investigation. A develop- 
mental study, longitudinal in design, 
should throw some light on the question 
of whether those pupils who live in a pre- 
dominantly teacher disapproval atmos- 
phere become progressively more ‘‘poor- 
ly adjusted.’’ The present study provides 
only a hint that this may be the actual 
state of affairs. 


4. The mean ‘‘self’’ scores for the five ap- 
proval- disapproval groups onthe Cali- 
fornia Test of Personality may be interpre- 
ted ina similar way to the total scores pre- 
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sented above. This similar finding 

might be expected in part since the 
‘‘self’’ scores are included as a part of 
the total scores. As far as ‘‘self adjust- 
ment’’ is concerned the HA-LD group is 
superior to the HD-LA and HD-MA groups. 
There is one difference in interpretation 
between the ‘‘self’’ and total scores. The 
HA-MD group shows significantly better 
‘‘self adjustment’’ than any of the other 
four groups. With the present knowledge 
available the writers are not in a position 
to explain why that group, which is highly 
praised but which also receives a moder- 
ate amount of disapproval, shows signif- 
icantly better ‘‘self adjustment’’ than 

that group which also receives a high 
amount of approval and little disapproval. 
One might hazard a guess that children 
who receive large amounts of approval 
and essentially no disapproval may be 
operating under such intense pressure to 
conform to social demands as to be act- 
ually handicapped in areas of ‘‘self ad- 
justment.’’ There is a very slight amount 
of evidence in the present data to support 
such a hypothesis in the fact that the HA- 
LD group receives a higher mean score 
in ‘‘social adjustment’’ than does the HA- 
MD group (see conclusion number 5, be- 
low). This hypothesis needs to be exam- 
ined critically by future studies of this 
type. In conclusion, it may be stated with 
a high degree of confidence that groups of 
children who receive high amounts of 
teacher approval demonstrate ‘‘self ad- 
justment’’ that is superior to groups of 
children who experience excessive teacher 
disapproval. 


As shown in Table V on mean scores, 
‘*social’’ scores on the California Test of 
Personality decline fairly regularly with 
progression from the teacher approval to 
the teacher disapproval end of the ‘‘con- 
tinuum.’’ The differences in the means 

of the ‘‘social’’ scores are somewhat 
difficult to interpret in a rigorous way be- 
cause of the apparent lack of homogeneity 
of variance among the five approval-dis- 
approval groups. In those groups in which 
the variability of scores appears to be 
homogeneous the obtained t’s may be in- 
terpreted as showing that those groups ex- 
periencing large amounts of teacher ap- 
proval demonstrate superior ‘‘social ad- 
justment.’’ Not only do the groups re- 
ceiving large amounts of approval have 
higher mean scores in ‘‘social adjustment”’ 
but they also show significantly less vari- 
ability in ‘“‘social adjustment’’ scores. 


As an overall evaluation of the relationship be- 
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tween teacher approval-disapproval patterns 
and psychological characteristics of children 

it may be stated, as a first approximation, 

that children who are considered by their class- 
mates to be experiencing a high degree of 
teacher approval coupled with a low or mod- 
erate degree of disapproval are more intelli- 
gent, are better students, and are better ad- 
justed than their less favored classmates. 
Whatever cause-effect relationship may exist 
between teacher approval and superior psychol- 
ogical adjustment cannot be determined in any 
precise way by the data at hand. However, one 
gets some hints that children who experience 

a high degree of teacher approval have a better 
opinion of themselves, are more ‘‘outgoing,’’ 
and have more confidence in their ability to 
adjust to social situations. 


Discussion 


The scales of teacher approval and of teacher 
disapproval which have been developed in this 
investigation have interesting possibilities in a 
number of areas. These instruments should 
prove helpful in conducting further research of 
the type described in the second phase of the 
present study. It is obvious that more work 
needs to be done to relate the results obtained 
with these scales to social-incentive conditions 
in the classroom as an impartial adult judge 
would evaluate them over a long period of ob- 
servation. It might also be fruitful to determ- 
ine the relationship between the results obtained 
with these scales and social-incentive conditions 
in the classroom as the teacher sees them. It 
appears entirely possible that the scales of 
teacher approval and disapproval might be re- 
vised so that they would be more adequate in 
terms of reliability to measure the social-in- 
centive conditions experienced by individual 
students. However, it seems desirable to the 
writers to ascertain the general utility of the 
scales before attempting further refinement. 
Some of the possible uses of these scales are as 
follows: 


1. Data of this type might be used to measure 
one aspect of teacher effectiveness in pro- 
moting the general psychological develop- 
ment of pupils. In this way, these instru- 
ments would be useful to teacher-prepar- 
ation programs and to in-service teacher 
study groups. 


2. Data obtained with these scales might 
provide additional information for predict- 
ing the psychological development of chil- 
dren experiencing a certain pattern of 
teacher approval-disapproval. 


The results of the second phase of the present 
investigation are interesting since they pose 
many different hypotheses which should be more 
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adequately tested by further research. Some of 
these hypotheses are as follows: 


1. There are individual differences among 
teachers in generating social-incentive 
atmospheres of different types. In some 
classrooms a small number of children 
receive a disproportionately high per- 
centage of teacher approval. A similar 
condition exists for teacher disapproval. 


These results appear to reflect values 
that are common to our culture, e.g., 
those children who tend to be more in- 
telligent and to obtain better grades in 
school receive most of the teacher 
praise. 


. There appears to be some evidence that 
children experiencing large amounts of 
teacher disapproval evaluate their per- 
sonal worth and general social adjust- 
ment as being lower in status than that 
of their classmates. 


There is also a hint in the data at hand 
that the children who receive large 
amounts of teacher approval and little 
teacher disapproval may be under con- 
siderable social pressure to conform to 
the conditions for good conduct imposed 
by the teacher. 


It should be noted that these hypotheses, which 
seem reasonable to the writers, require much 
additional study before conclusions can be drawn 
at a reasonably high level of confidence. At the 
present time a longitudinal study is under way 
which attempts to throw additional light on some 
of the problems raised in this investigation. 


Summary 


The purpose of this investigation was to devel- 
op scales of teacher approval and teacher dis- 
approval of the ‘“‘Guess Who?’’ type and to de- 
termine some of the psychological character- 
istics of children living under more-or-less 
clear-cut patterns of teacher approval-disap- 
proval. The scaling procedure devised for this 
study consisted simply of presenting a brief and 
relatively unstructured description of some be- 
havior that could be attributed to any member 
of a group and then asking the various members 
of this group to nominate one or more of their 
associates whose behavior most nearly fits the 
description. 


The internal consistency of the items on the 
preliminary teacher approval and teacher dis- 
approval scales was found to be fairly satis- 
factory for tests of this type. The stability of 
the pupil ratings on the revised teacher ap- 
proval and teacher disapproval scales over a 
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five-week period was found to be quite adequate | between teacher approval-disapproval patterns 
for group comparisons--correlation coefficients | and psychological characteristics of children 
approximating .80 were obtained. The data ob- | it may be stated, as a first approximation, that 
tained with the teacher approval and teacher children who are considered by their classmates 
disapproval scales show that four or five pupils | to be experiencing a high degree of teacher ap- 
ina typical classroom of 30-35 children receive | proval coupled with a low or moderate degree 
from 35-70 percent of the total nominations for | of disapproval are more intelligent, are better 
teacher approval, while four or five different students, and are better adjusted than their less 
pupils in the same classroom receive 25-40 per-| favored classmates. Whatever cause-effect 

cent of the nominations for teacher disapproval. | relationship may exist between teacher approval 
The distributions of teacher approval and dis- and superior psychological adjustment cannot be 
approval for several classrooms are presented | determined in any precise way by the data at 

in graphic form. hand. However, one gets some hints that child- 


ren who experience a high degree of teacher ap- 
The California Tests of Mental Maturity and proval have a better opinion of themselves, are 
pence ht mart ny es -y y t more ‘‘outgoing,’’ and have more confidence in 
tests we their ability to adjust to social situat " 
classrooms. The children in these classrooms 7 - - fons 
divided into five more-or-less non-over- 
pving groups with respect > patterns of Some of the possible uses in further research 
teacher approval-disapproval. The results of and practical application of the teacher approval 
an analysis of the psychological characteristics and teacher disapproval scales are discussed. 
of the children included in these five groups are | S¢veral hypotheses about the psychological de- 
presented in tabular form. Results of various velopment and adjustment of children experienc- 
statistical analyses are reported in detail. ing different patterns of teacher approval and 


disapproval are presented with suggestions for 
As an overall evaluation of the relationship further research. 
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